Warning: Permanently added '52.201.9.59' (ED25519) to the list of known hosts. You can reproduce this build on your computer by running: sudo dnf install copr-rpmbuild /usr/bin/copr-rpmbuild --verbose --drop-resultdir --task-url https://copr.fedorainfracloud.org/backend/get-build-task/8324975-fedora-39-aarch64 --chroot fedora-39-aarch64 Version: 1.2 PID: 9273 Logging PID: 9274 Task: {'allow_user_ssh': False, 'appstream': False, 'background': False, 'build_id': 8324975, 'buildroot_pkgs': [], 'chroot': 'fedora-39-aarch64', 'enable_net': True, 'fedora_review': False, 'git_hash': '67f394ffca7a2f91abfbee4b994b11590d3f91d8', 'git_repo': 'https://copr-dist-git.fedorainfracloud.org/git/rezso/ML/cutlass', 'isolation': 'default', 'memory_reqs': 2048, 'package_name': 'cutlass', 'package_version': '3.6.0-20241009.0.gitcc3c29a8.cu12_6', 'project_dirname': 'ML', 'project_name': 'ML', 'project_owner': 'rezso', 'repo_priority': None, 'repos': [{'baseurl': 'https://download.copr.fedorainfracloud.org/results/rezso/ML/fedora-39-aarch64/', 'id': 'copr_base', 'name': 'Copr repository', 'priority': None}, {'baseurl': 'https://download.copr.fedorainfracloud.org/results/rezso/CUDA/fedora-39-aarch64/', 'id': 'copr_rezso_CUDA', 'name': 'Additional repo copr_rezso_CUDA'}, {'baseurl': 'http://developer.download.nvidia.com/compute/cuda/repos/rhel9/x86_64', 'id': 'http_developer_download_nvidia_com_compute_cuda_repos_rhel9_x86_64', 'name': 'Additional repo http_developer_download_nvidia_com_compute_cuda_repos_rhel9_x86_64'}, {'baseurl': 'http://developer.download.nvidia.com/compute/cuda/repos/rhel9/sbsa', 'id': 'http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa', 'name': 'Additional repo http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa'}], 'sandbox': 'rezso/ML--rezso', 'source_json': {}, 'source_type': None, 'ssh_public_keys': None, 'storage': None, 'submitter': 'rezso', 'tags': [], 'task_id': '8324975-fedora-39-aarch64', 'timeout': 172800, 'uses_devel_repo': False, 'with_opts': [], 'without_opts': []} Running: git clone https://copr-dist-git.fedorainfracloud.org/git/rezso/ML/cutlass /var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass --depth 500 --no-single-branch --recursive cmd: ['git', 'clone', 'https://copr-dist-git.fedorainfracloud.org/git/rezso/ML/cutlass', '/var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass', '--depth', '500', '--no-single-branch', '--recursive'] cwd: . rc: 0 stdout: stderr: Cloning into '/var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass'... Running: git checkout 67f394ffca7a2f91abfbee4b994b11590d3f91d8 -- cmd: ['git', 'checkout', '67f394ffca7a2f91abfbee4b994b11590d3f91d8', '--'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass rc: 0 stdout: stderr: Note: switching to '67f394ffca7a2f91abfbee4b994b11590d3f91d8'. You are in 'detached HEAD' state. You can look around, make experimental changes and commit them, and you can discard any commits you make in this state without impacting any branches by switching back to a branch. If you want to create a new branch to retain commits you create, you may do so (now or later) by using -c with the switch command. Example: git switch -c Or undo this operation with: git switch - Turn off this advice by setting config variable advice.detachedHead to false HEAD is now at 67f394f automatic import of cutlass Running: dist-git-client sources cmd: ['dist-git-client', 'sources'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass rc: 0 stdout: stderr: INFO: Reading stdout from command: git rev-parse --abbrev-ref HEAD INFO: Reading stdout from command: git rev-parse HEAD INFO: Reading sources specification file: sources /usr/bin/tail: /var/lib/copr-rpmbuild/main.log: file truncated Running (timeout=172800): unbuffer mock --spec /var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass/cutlass.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1732846793.454290 -r /var/lib/copr-rpmbuild/results/configs/child.cfg INFO: mock.py version 5.9 starting (python version = 3.13.0, NVR = mock-5.9-1.fc41), args: /usr/libexec/mock/mock --spec /var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass/cutlass.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1732846793.454290 -r /var/lib/copr-rpmbuild/results/configs/child.cfg Start(bootstrap): init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish(bootstrap): init plugins Start: init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish: init plugins INFO: Signal handler active Start: run INFO: Start(/var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass/cutlass.spec) Config(fedora-39-aarch64) Start: clean chroot Finish: clean chroot Mock Version: 5.9 INFO: Mock Version: 5.9 Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-39-aarch64-bootstrap-1732846793.454290/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata INFO: Guessed host environment type: unknown INFO: Using bootstrap image: registry.fedoraproject.org/fedora:39 INFO: Pulling image: registry.fedoraproject.org/fedora:39 INFO: Copy content of container registry.fedoraproject.org/fedora:39 to /var/lib/mock/fedora-39-aarch64-bootstrap-1732846793.454290/root INFO: Checking that registry.fedoraproject.org/fedora:39 image matches host's architecture INFO: mounting registry.fedoraproject.org/fedora:39 with podman image mount INFO: image registry.fedoraproject.org/fedora:39 as /var/lib/containers/storage/overlay/5acd9ba9825a2c2ec8c7534bebfea6f809d156320a167dae1c9ddbe095466a28/merged INFO: umounting image registry.fedoraproject.org/fedora:39 (/var/lib/containers/storage/overlay/5acd9ba9825a2c2ec8c7534bebfea6f809d156320a167dae1c9ddbe095466a28/merged) with podman image umount INFO: Package manager dnf4 detected and used (fallback) INFO: Bootstrap image not marked ready Start(bootstrap): installing dnf tooling No matches found for the following disable plugin patterns: local, spacewalk, versionlock Copr repository 12 MB/s | 978 kB 00:00 Additional repo copr_rezso_CUDA 1.0 MB/s | 72 kB 00:00 Additional repo http_developer_download_nvidia_ 158 MB/s | 2.2 MB 00:00 Additional repo http_developer_download_nvidia_ 137 MB/s | 1.7 MB 00:00 fedora 56 MB/s | 86 MB 00:01 updates 56 MB/s | 40 MB 00:00 Package python3-dnf-4.21.1-1.fc39.noarch is already installed. Dependencies resolved. ================================================================================ Package Arch Version Repository Size ================================================================================ Installing: python3-dnf-plugins-core noarch 4.9.0-1.fc39 updates 320 k Installing dependencies: dbus-libs aarch64 1:1.14.10-1.fc39 fedora 156 k python3-dateutil noarch 1:2.8.2-10.fc39 fedora 355 k python3-dbus aarch64 1.3.2-4.fc39 fedora 157 k python3-distro noarch 1.8.0-6.fc39 fedora 49 k python3-six noarch 1.16.0-12.fc39 fedora 41 k python3-systemd aarch64 235-5.fc39 fedora 107 k Transaction Summary ================================================================================ Install 7 Packages Total download size: 1.2 M Installed size: 4.8 M Downloading Packages: (1/7): dbus-libs-1.14.10-1.fc39.aarch64.rpm 7.5 MB/s | 156 kB 00:00 (2/7): python3-dateutil-2.8.2-10.fc39.noarch.rp 15 MB/s | 355 kB 00:00 (3/7): python3-distro-1.8.0-6.fc39.noarch.rpm 17 MB/s | 49 kB 00:00 (4/7): python3-dbus-1.3.2-4.fc39.aarch64.rpm 6.2 MB/s | 157 kB 00:00 (5/7): python3-six-1.16.0-12.fc39.noarch.rpm 14 MB/s | 41 kB 00:00 (6/7): python3-systemd-235-5.fc39.aarch64.rpm 31 MB/s | 107 kB 00:00 (7/7): python3-dnf-plugins-core-4.9.0-1.fc39.no 60 MB/s | 320 kB 00:00 -------------------------------------------------------------------------------- Total 2.8 MB/s | 1.2 MB 00:00 Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Preparing : 1/1 Installing : python3-systemd-235-5.fc39.aarch64 1/7 Installing : python3-six-1.16.0-12.fc39.noarch 2/7 Installing : python3-dateutil-1:2.8.2-10.fc39.noarch 3/7 Installing : python3-distro-1.8.0-6.fc39.noarch 4/7 Installing : dbus-libs-1:1.14.10-1.fc39.aarch64 5/7 Installing : python3-dbus-1.3.2-4.fc39.aarch64 6/7 Installing : python3-dnf-plugins-core-4.9.0-1.fc39.noarch 7/7 Running scriptlet: python3-dnf-plugins-core-4.9.0-1.fc39.noarch 7/7 Verifying : dbus-libs-1:1.14.10-1.fc39.aarch64 1/7 Verifying : python3-dateutil-1:2.8.2-10.fc39.noarch 2/7 Verifying : python3-dbus-1.3.2-4.fc39.aarch64 3/7 Verifying : python3-distro-1.8.0-6.fc39.noarch 4/7 Verifying : python3-six-1.16.0-12.fc39.noarch 5/7 Verifying : python3-systemd-235-5.fc39.aarch64 6/7 Verifying : python3-dnf-plugins-core-4.9.0-1.fc39.noarch 7/7 Installed: dbus-libs-1:1.14.10-1.fc39.aarch64 python3-dateutil-1:2.8.2-10.fc39.noarch python3-dbus-1.3.2-4.fc39.aarch64 python3-distro-1.8.0-6.fc39.noarch python3-dnf-plugins-core-4.9.0-1.fc39.noarch python3-six-1.16.0-12.fc39.noarch python3-systemd-235-5.fc39.aarch64 Complete! Finish(bootstrap): installing dnf tooling Start(bootstrap): creating root cache Finish(bootstrap): creating root cache Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-39-aarch64-1732846793.454290/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Package manager dnf4 detected and used (direct choice) INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-4.19.1.1-1.fc39.aarch64 rpm-sequoia-1.7.0-1.fc39.aarch64 python3-dnf-4.21.1-1.fc39.noarch python3-dnf-plugins-core-4.9.0-1.fc39.noarch yum-4.21.1-1.fc39.noarch Start: installing minimal buildroot with dnf No matches found for the following disable plugin patterns: local, spacewalk, versionlock Copr repository 19 MB/s | 978 kB 00:00 Additional repo copr_rezso_CUDA 805 kB/s | 72 kB 00:00 Additional repo http_developer_download_nvidia_ 172 MB/s | 2.2 MB 00:00 Additional repo http_developer_download_nvidia_ 140 MB/s | 1.7 MB 00:00 fedora 49 MB/s | 86 MB 00:01 updates 58 MB/s | 40 MB 00:00 Dependencies resolved. ======================================================================================== Package Arch Version Repo Size ======================================================================================== Installing group/module packages: bash aarch64 5.2.26-1.fc39 updates 1.8 M bzip2 aarch64 1.0.8-16.fc39 fedora 52 k coreutils aarch64 9.3-7.fc39 updates 1.1 M cpio aarch64 2.14-4.fc39 fedora 277 k diffutils aarch64 3.10-3.fc39 fedora 396 k fedora-release-common noarch 39-36 updates 19 k findutils aarch64 1:4.9.0-6.fc39 updates 494 k gawk aarch64 5.2.2-2.fc39 fedora 1.1 M glibc-minimal-langpack aarch64 2.38-99.fc39 copr_base 67 k grep aarch64 3.11-3.fc39 fedora 295 k gzip aarch64 1.12-6.fc39 fedora 164 k info aarch64 7.0.3-3.fc39 fedora 179 k patch aarch64 2.7.6-22.fc39 fedora 123 k redhat-rpm-config noarch 266-1.fc39 updates 78 k rpm-build aarch64 4.19.1.1-1.fc39 updates 79 k sed aarch64 4.8-14.fc39 fedora 304 k shadow-utils aarch64 2:4.14.0-2.fc39 updates 1.3 M tar aarch64 2:1.35-2.fc39 fedora 854 k unzip aarch64 6.0-62.fc39 fedora 183 k util-linux aarch64 2.39.4-1.fc39 updates 1.2 M which aarch64 2.21-40.fc39 fedora 42 k xz aarch64 5.4.4-1.fc39 fedora 556 k Installing dependencies: alternatives aarch64 1.26-1.fc39 updates 38 k ansible-srpm-macros noarch 1-12.fc39 updates 21 k audit-libs aarch64 3.1.5-1.fc39 updates 124 k authselect aarch64 1.4.3-1.fc39 fedora 150 k authselect-libs aarch64 1.4.3-1.fc39 fedora 249 k basesystem noarch 11-18.fc39 fedora 7.2 k binutils aarch64 2.40-14.fc39 updates 6.1 M binutils-gold aarch64 2.40-14.fc39 updates 945 k bzip2-libs aarch64 1.0.8-16.fc39 fedora 43 k ca-certificates noarch 2024.2.69_v8.0.401-1.0.fc39 updates 871 k coreutils-common aarch64 9.3-7.fc39 updates 2.1 M cracklib aarch64 2.9.11-2.fc39 fedora 94 k crypto-policies noarch 20231204-1.git1e3a2e4.fc39 updates 100 k curl aarch64 8.2.1-5.fc39 updates 340 k cyrus-sasl-lib aarch64 2.1.28-11.fc39 fedora 781 k debugedit aarch64 5.0-12.fc39 updates 78 k dwz aarch64 0.15-3.fc39 fedora 136 k ed aarch64 1.19-4.fc39 fedora 78 k efi-srpm-macros noarch 5-9.fc39 fedora 22 k elfutils aarch64 0.192-4.fc39 updates 569 k elfutils-debuginfod-client aarch64 0.192-4.fc39 updates 43 k elfutils-default-yama-scope noarch 0.192-4.fc39 updates 12 k elfutils-libelf aarch64 0.192-4.fc39 updates 208 k elfutils-libs aarch64 0.192-4.fc39 updates 266 k fedora-gpg-keys noarch 39-2 updates 130 k fedora-release noarch 39-36 updates 8.6 k fedora-release-identity-basic noarch 39-36 updates 9.4 k fedora-repos noarch 39-2 updates 9.3 k file aarch64 5.44-5.fc39 fedora 49 k file-libs aarch64 5.44-5.fc39 fedora 729 k filesystem aarch64 3.18-6.fc39 fedora 1.1 M fonts-srpm-macros noarch 1:2.0.5-12.fc39 fedora 26 k forge-srpm-macros noarch 0.3.1-1.fc39 updates 19 k fpc-srpm-macros noarch 1.3-8.fc39 fedora 7.4 k gdb-minimal aarch64 15.1-1.fc39 updates 3.9 M gdbm-libs aarch64 1:1.23-4.fc39 fedora 56 k ghc-srpm-macros noarch 1.6.1-2.fc39 fedora 7.8 k glibc aarch64 2.38-99.fc39 copr_base 1.7 M glibc-common aarch64 2.38-99.fc39 copr_base 338 k glibc-gconv-extra aarch64 2.38-99.fc39 copr_base 1.9 M gmp aarch64 1:6.2.1-5.fc39 fedora 266 k gnat-srpm-macros noarch 6-3.fc39 fedora 8.8 k go-srpm-macros noarch 3.5.0-1.fc39 updates 28 k jansson aarch64 2.13.1-7.fc39 fedora 46 k json-c aarch64 0.17-1.fc39 fedora 44 k kernel-srpm-macros noarch 1.0-20.fc39 fedora 10 k keyutils-libs aarch64 1.6.3-1.fc39 updates 32 k krb5-libs aarch64 1.21.3-2.fc39 updates 771 k libacl aarch64 2.3.1-9.fc39 updates 24 k libarchive aarch64 3.7.1-3.fc39 updates 401 k libattr aarch64 2.5.1-8.fc39 fedora 18 k libblkid aarch64 2.39.4-1.fc39 updates 116 k libbrotli aarch64 1.1.0-1.fc39 fedora 345 k libcap aarch64 2.48-9.fc39 updates 69 k libcap-ng aarch64 0.8.3-8.fc39 fedora 32 k libcom_err aarch64 1.47.0-2.fc39 fedora 26 k libcurl aarch64 8.2.1-5.fc39 updates 316 k libdb aarch64 5.3.28-56.fc39 fedora 735 k libeconf aarch64 0.5.2-2.fc39 updates 30 k libevent aarch64 2.1.12-9.fc39 fedora 254 k libfdisk aarch64 2.39.4-1.fc39 updates 157 k libffi aarch64 3.4.4-4.fc39 fedora 38 k libgcc aarch64 13.3.1-3.fc39 updates 104 k libgomp aarch64 13.3.1-3.fc39 updates 321 k libidn2 aarch64 2.3.7-1.fc39 updates 120 k libmount aarch64 2.39.4-1.fc39 updates 153 k libnghttp2 aarch64 1.55.1-5.fc39 updates 76 k libnsl2 aarch64 2.0.0-6.fc39 fedora 30 k libpkgconf aarch64 1.9.5-2.fc39 fedora 38 k libpsl aarch64 0.21.2-4.fc39 fedora 63 k libpwquality aarch64 1.4.5-6.fc39 fedora 120 k libselinux aarch64 3.5-5.fc39 fedora 86 k libsemanage aarch64 3.5-4.fc39 fedora 117 k libsepol aarch64 3.5-2.fc39 fedora 311 k libsigsegv aarch64 2.14-5.fc39 fedora 27 k libsmartcols aarch64 2.39.4-1.fc39 updates 65 k libssh aarch64 0.10.6-2.fc39 updates 213 k libssh-config noarch 0.10.6-2.fc39 updates 9.0 k libstdc++ aarch64 13.3.1-3.fc39 updates 819 k libtasn1 aarch64 4.19.0-3.fc39 fedora 73 k libtirpc aarch64 1.3.6-0.fc39 updates 95 k libtool-ltdl aarch64 2.4.7-7.fc39 fedora 36 k libunistring aarch64 1.1-5.fc39 fedora 540 k libutempter aarch64 1.2.1-10.fc39 fedora 27 k libuuid aarch64 2.39.4-1.fc39 updates 28 k libverto aarch64 0.3.2-6.fc39 fedora 21 k libxcrypt aarch64 4.4.36-2.fc39 fedora 123 k libxml2 aarch64 2.10.4-3.fc39 fedora 689 k libzstd aarch64 1.5.6-1.fc39 updates 284 k lua-libs aarch64 5.4.6-3.fc39 fedora 131 k lua-srpm-macros noarch 1-13.fc39 updates 8.7 k lz4-libs aarch64 1.9.4-4.fc39 fedora 68 k mpfr aarch64 4.2.0-3.fc39 fedora 319 k ncurses-base noarch 6.4-7.20230520.fc39.1 updates 88 k ncurses-libs aarch64 6.4-7.20230520.fc39.1 updates 326 k ocaml-srpm-macros noarch 8-2.fc39 fedora 14 k openblas-srpm-macros noarch 2-14.fc39 fedora 7.5 k openldap aarch64 2.6.7-1.fc39 updates 250 k openssl-libs aarch64 1:3.1.4-4.fc39 updates 2.0 M p11-kit aarch64 0.25.5-1.fc39 updates 495 k p11-kit-trust aarch64 0.25.5-1.fc39 updates 138 k package-notes-srpm-macros noarch 0.5-9.fc39 fedora 11 k pam aarch64 1.5.3-3.fc39 updates 552 k pam-libs aarch64 1.5.3-3.fc39 updates 57 k pcre2 aarch64 10.42-1.fc39.2 fedora 219 k pcre2-syntax noarch 10.42-1.fc39.2 fedora 143 k perl-srpm-macros noarch 1-51.fc39 fedora 8.0 k pkgconf aarch64 1.9.5-2.fc39 fedora 42 k pkgconf-m4 noarch 1.9.5-2.fc39 fedora 14 k pkgconf-pkg-config aarch64 1.9.5-2.fc39 fedora 9.6 k popt aarch64 1.19-3.fc39 fedora 66 k publicsuffix-list-dafsa noarch 20240107-1.fc39 updates 58 k pyproject-srpm-macros noarch 1.16.0-1.fc39 updates 14 k python-srpm-macros noarch 3.12-8.fc39 updates 23 k qt5-srpm-macros noarch 5.15.14-2.fc39 updates 8.9 k qt6-srpm-macros noarch 6.6.2-1.fc39 updates 8.9 k readline aarch64 8.2-6.fc39 updates 212 k rpm aarch64 4.19.1.1-1.fc39 updates 536 k rpm-build-libs aarch64 4.19.1.1-1.fc39 updates 91 k rpm-libs aarch64 4.19.1.1-1.fc39 updates 305 k rpm-sequoia aarch64 1.7.0-1.fc39 updates 868 k rpmautospec-rpm-macros noarch 0.7.3-1.fc39 updates 11 k rust-srpm-macros noarch 26.3-1.fc39 updates 13 k setup noarch 2.14.4-1.fc39 fedora 154 k sqlite-libs aarch64 3.42.0-7.fc39 fedora 677 k systemd-libs aarch64 254.20-1.fc39 updates 660 k util-linux-core aarch64 2.39.4-1.fc39 updates 505 k xxhash-libs aarch64 0.8.2-4.fc39 updates 34 k xz-libs aarch64 5.4.4-1.fc39 fedora 106 k zip aarch64 3.0-39.fc39 fedora 262 k zlib aarch64 1.2.13-4.fc39 fedora 93 k zstd aarch64 1.5.6-1.fc39 updates 445 k Installing Groups: Buildsystem building group Transaction Summary ======================================================================================== Install 154 Packages Total download size: 52 M Installed size: 303 M Downloading Packages: (1/154): glibc-gconv-extra-2.38-99.fc39.aarch64 36 MB/s | 1.9 MB 00:00 (2/154): glibc-common-2.38-99.fc39.aarch64.rpm 5.8 MB/s | 338 kB 00:00 (3/154): glibc-2.38-99.fc39.aarch64.rpm 26 MB/s | 1.7 MB 00:00 (4/154): authselect-1.4.3-1.fc39.aarch64.rpm 13 MB/s | 150 kB 00:00 (5/154): authselect-libs-1.4.3-1.fc39.aarch64.r 70 MB/s | 249 kB 00:00 (6/154): basesystem-11-18.fc39.noarch.rpm 3.5 MB/s | 7.2 kB 00:00 (7/154): bzip2-1.0.8-16.fc39.aarch64.rpm 24 MB/s | 52 kB 00:00 (8/154): bzip2-libs-1.0.8-16.fc39.aarch64.rpm 18 MB/s | 43 kB 00:00 (9/154): glibc-minimal-langpack-2.38-99.fc39.aa 3.2 MB/s | 67 kB 00:00 (10/154): cpio-2.14-4.fc39.aarch64.rpm 75 MB/s | 277 kB 00:00 (11/154): cracklib-2.9.11-2.fc39.aarch64.rpm 27 MB/s | 94 kB 00:00 (12/154): cyrus-sasl-lib-2.1.28-11.fc39.aarch64 93 MB/s | 781 kB 00:00 (13/154): dwz-0.15-3.fc39.aarch64.rpm 20 MB/s | 136 kB 00:00 (14/154): diffutils-3.10-3.fc39.aarch64.rpm 49 MB/s | 396 kB 00:00 (15/154): ed-1.19-4.fc39.aarch64.rpm 24 MB/s | 78 kB 00:00 (16/154): efi-srpm-macros-5-9.fc39.noarch.rpm 8.6 MB/s | 22 kB 00:00 (17/154): file-5.44-5.fc39.aarch64.rpm 22 MB/s | 49 kB 00:00 (18/154): file-libs-5.44-5.fc39.aarch64.rpm 141 MB/s | 729 kB 00:00 (19/154): fonts-srpm-macros-2.0.5-12.fc39.noarc 5.6 MB/s | 26 kB 00:00 (20/154): filesystem-3.18-6.fc39.aarch64.rpm 125 MB/s | 1.1 MB 00:00 (21/154): fpc-srpm-macros-1.3-8.fc39.noarch.rpm 1.6 MB/s | 7.4 kB 00:00 (22/154): gdbm-libs-1.23-4.fc39.aarch64.rpm 18 MB/s | 56 kB 00:00 (23/154): ghc-srpm-macros-1.6.1-2.fc39.noarch.r 2.3 MB/s | 7.8 kB 00:00 (24/154): gnat-srpm-macros-6-3.fc39.noarch.rpm 3.9 MB/s | 8.8 kB 00:00 (25/154): gmp-6.2.1-5.fc39.aarch64.rpm 66 MB/s | 266 kB 00:00 (26/154): gawk-5.2.2-2.fc39.aarch64.rpm 80 MB/s | 1.1 MB 00:00 (27/154): grep-3.11-3.fc39.aarch64.rpm 70 MB/s | 295 kB 00:00 (28/154): gzip-1.12-6.fc39.aarch64.rpm 41 MB/s | 164 kB 00:00 (29/154): info-7.0.3-3.fc39.aarch64.rpm 28 MB/s | 179 kB 00:00 (30/154): jansson-2.13.1-7.fc39.aarch64.rpm 6.5 MB/s | 46 kB 00:00 (31/154): json-c-0.17-1.fc39.aarch64.rpm 7.2 MB/s | 44 kB 00:00 (32/154): kernel-srpm-macros-1.0-20.fc39.noarch 4.8 MB/s | 10 kB 00:00 (33/154): libattr-2.5.1-8.fc39.aarch64.rpm 10 MB/s | 18 kB 00:00 (34/154): libbrotli-1.1.0-1.fc39.aarch64.rpm 97 MB/s | 345 kB 00:00 (35/154): libcap-ng-0.8.3-8.fc39.aarch64.rpm 8.6 MB/s | 32 kB 00:00 (36/154): libcom_err-1.47.0-2.fc39.aarch64.rpm 7.5 MB/s | 26 kB 00:00 (37/154): libevent-2.1.12-9.fc39.aarch64.rpm 49 MB/s | 254 kB 00:00 (38/154): libdb-5.3.28-56.fc39.aarch64.rpm 100 MB/s | 735 kB 00:00 (39/154): libnsl2-2.0.0-6.fc39.aarch64.rpm 13 MB/s | 30 kB 00:00 (40/154): libpkgconf-1.9.5-2.fc39.aarch64.rpm 11 MB/s | 38 kB 00:00 (41/154): libpsl-0.21.2-4.fc39.aarch64.rpm 20 MB/s | 63 kB 00:00 (42/154): libselinux-3.5-5.fc39.aarch64.rpm 28 MB/s | 86 kB 00:00 (43/154): libpwquality-1.4.5-6.fc39.aarch64.rpm 23 MB/s | 120 kB 00:00 (44/154): libsemanage-3.5-4.fc39.aarch64.rpm 45 MB/s | 117 kB 00:00 (45/154): libffi-3.4.4-4.fc39.aarch64.rpm 2.1 MB/s | 38 kB 00:00 (46/154): libsepol-3.5-2.fc39.aarch64.rpm 49 MB/s | 311 kB 00:00 (47/154): libsigsegv-2.14-5.fc39.aarch64.rpm 4.6 MB/s | 27 kB 00:00 (48/154): libtasn1-4.19.0-3.fc39.aarch64.rpm 11 MB/s | 73 kB 00:00 (49/154): libtool-ltdl-2.4.7-7.fc39.aarch64.rpm 8.6 MB/s | 36 kB 00:00 (50/154): libutempter-1.2.1-10.fc39.aarch64.rpm 14 MB/s | 27 kB 00:00 (51/154): libunistring-1.1-5.fc39.aarch64.rpm 63 MB/s | 540 kB 00:00 (52/154): libverto-0.3.2-6.fc39.aarch64.rpm 3.6 MB/s | 21 kB 00:00 (53/154): libxcrypt-4.4.36-2.fc39.aarch64.rpm 17 MB/s | 123 kB 00:00 (54/154): lua-libs-5.4.6-3.fc39.aarch64.rpm 18 MB/s | 131 kB 00:00 (55/154): libxml2-2.10.4-3.fc39.aarch64.rpm 72 MB/s | 689 kB 00:00 (56/154): lz4-libs-1.9.4-4.fc39.aarch64.rpm 9.4 MB/s | 68 kB 00:00 (57/154): ocaml-srpm-macros-8-2.fc39.noarch.rpm 6.7 MB/s | 14 kB 00:00 (58/154): mpfr-4.2.0-3.fc39.aarch64.rpm 71 MB/s | 319 kB 00:00 (59/154): openblas-srpm-macros-2-14.fc39.noarch 2.6 MB/s | 7.5 kB 00:00 (60/154): package-notes-srpm-macros-0.5-9.fc39. 8.4 MB/s | 11 kB 00:00 (61/154): patch-2.7.6-22.fc39.aarch64.rpm 51 MB/s | 123 kB 00:00 (62/154): pcre2-10.42-1.fc39.2.aarch64.rpm 62 MB/s | 219 kB 00:00 (63/154): pcre2-syntax-10.42-1.fc39.2.noarch.rp 45 MB/s | 143 kB 00:00 (64/154): perl-srpm-macros-1-51.fc39.noarch.rpm 6.1 MB/s | 8.0 kB 00:00 (65/154): pkgconf-m4-1.9.5-2.fc39.noarch.rpm 7.9 MB/s | 14 kB 00:00 (66/154): pkgconf-1.9.5-2.fc39.aarch64.rpm 18 MB/s | 42 kB 00:00 (67/154): pkgconf-pkg-config-1.9.5-2.fc39.aarch 4.4 MB/s | 9.6 kB 00:00 (68/154): popt-1.19-3.fc39.aarch64.rpm 26 MB/s | 66 kB 00:00 (69/154): sed-4.8-14.fc39.aarch64.rpm 97 MB/s | 304 kB 00:00 (70/154): setup-2.14.4-1.fc39.noarch.rpm 44 MB/s | 154 kB 00:00 (71/154): sqlite-libs-3.42.0-7.fc39.aarch64.rpm 152 MB/s | 677 kB 00:00 (72/154): tar-1.35-2.fc39.aarch64.rpm 140 MB/s | 854 kB 00:00 (73/154): unzip-6.0-62.fc39.aarch64.rpm 31 MB/s | 183 kB 00:00 (74/154): which-2.21-40.fc39.aarch64.rpm 9.5 MB/s | 42 kB 00:00 (75/154): xz-libs-5.4.4-1.fc39.aarch64.rpm 26 MB/s | 106 kB 00:00 (76/154): xz-5.4.4-1.fc39.aarch64.rpm 66 MB/s | 556 kB 00:00 (77/154): zlib-1.2.13-4.fc39.aarch64.rpm 16 MB/s | 93 kB 00:00 (78/154): zip-3.0-39.fc39.aarch64.rpm 24 MB/s | 262 kB 00:00 (79/154): alternatives-1.26-1.fc39.aarch64.rpm 6.0 MB/s | 38 kB 00:00 (80/154): ansible-srpm-macros-1-12.fc39.noarch. 4.1 MB/s | 21 kB 00:00 (81/154): audit-libs-3.1.5-1.fc39.aarch64.rpm 35 MB/s | 124 kB 00:00 (82/154): bash-5.2.26-1.fc39.aarch64.rpm 157 MB/s | 1.8 MB 00:00 (83/154): binutils-gold-2.40-14.fc39.aarch64.rp 79 MB/s | 945 kB 00:00 (84/154): ca-certificates-2024.2.69_v8.0.401-1. 114 MB/s | 871 kB 00:00 (85/154): coreutils-9.3-7.fc39.aarch64.rpm 116 MB/s | 1.1 MB 00:00 (86/154): binutils-2.40-14.fc39.aarch64.rpm 186 MB/s | 6.1 MB 00:00 (87/154): crypto-policies-20231204-1.git1e3a2e4 11 MB/s | 100 kB 00:00 (88/154): coreutils-common-9.3-7.fc39.aarch64.r 132 MB/s | 2.1 MB 00:00 (89/154): debugedit-5.0-12.fc39.aarch64.rpm 20 MB/s | 78 kB 00:00 (90/154): curl-8.2.1-5.fc39.aarch64.rpm 53 MB/s | 340 kB 00:00 (91/154): elfutils-default-yama-scope-0.192-4.f 5.9 MB/s | 12 kB 00:00 (92/154): elfutils-0.192-4.fc39.aarch64.rpm 84 MB/s | 569 kB 00:00 (93/154): elfutils-debuginfod-client-0.192-4.fc 7.7 MB/s | 43 kB 00:00 (94/154): elfutils-libelf-0.192-4.fc39.aarch64. 67 MB/s | 208 kB 00:00 (95/154): fedora-gpg-keys-39-2.noarch.rpm 50 MB/s | 130 kB 00:00 (96/154): elfutils-libs-0.192-4.fc39.aarch64.rp 66 MB/s | 266 kB 00:00 (97/154): fedora-release-39-36.noarch.rpm 4.1 MB/s | 8.6 kB 00:00 (98/154): fedora-release-common-39-36.noarch.rp 11 MB/s | 19 kB 00:00 (99/154): fedora-release-identity-basic-39-36.n 5.1 MB/s | 9.4 kB 00:00 (100/154): fedora-repos-39-2.noarch.rpm 4.8 MB/s | 9.3 kB 00:00 (101/154): forge-srpm-macros-0.3.1-1.fc39.noarc 8.0 MB/s | 19 kB 00:00 (102/154): findutils-4.9.0-6.fc39.aarch64.rpm 81 MB/s | 494 kB 00:00 (103/154): go-srpm-macros-3.5.0-1.fc39.noarch.r 6.6 MB/s | 28 kB 00:00 (104/154): keyutils-libs-1.6.3-1.fc39.aarch64.r 9.2 MB/s | 32 kB 00:00 (105/154): krb5-libs-1.21.3-2.fc39.aarch64.rpm 119 MB/s | 771 kB 00:00 (106/154): libacl-2.3.1-9.fc39.aarch64.rpm 4.6 MB/s | 24 kB 00:00 (107/154): gdb-minimal-15.1-1.fc39.aarch64.rpm 188 MB/s | 3.9 MB 00:00 (108/154): libarchive-3.7.1-3.fc39.aarch64.rpm 45 MB/s | 401 kB 00:00 (109/154): libblkid-2.39.4-1.fc39.aarch64.rpm 14 MB/s | 116 kB 00:00 (110/154): libcap-2.48-9.fc39.aarch64.rpm 28 MB/s | 69 kB 00:00 (111/154): libeconf-0.5.2-2.fc39.aarch64.rpm 10 MB/s | 30 kB 00:00 (112/154): libcurl-8.2.1-5.fc39.aarch64.rpm 75 MB/s | 316 kB 00:00 (113/154): libfdisk-2.39.4-1.fc39.aarch64.rpm 46 MB/s | 157 kB 00:00 (114/154): libgcc-13.3.1-3.fc39.aarch64.rpm 46 MB/s | 104 kB 00:00 (115/154): libgomp-13.3.1-3.fc39.aarch64.rpm 91 MB/s | 321 kB 00:00 (116/154): libidn2-2.3.7-1.fc39.aarch64.rpm 32 MB/s | 120 kB 00:00 (117/154): libmount-2.39.4-1.fc39.aarch64.rpm 44 MB/s | 153 kB 00:00 (118/154): libnghttp2-1.55.1-5.fc39.aarch64.rpm 36 MB/s | 76 kB 00:00 (119/154): libsmartcols-2.39.4-1.fc39.aarch64.r 26 MB/s | 65 kB 00:00 (120/154): libssh-config-0.10.6-2.fc39.noarch.r 5.5 MB/s | 9.0 kB 00:00 (121/154): libssh-0.10.6-2.fc39.aarch64.rpm 51 MB/s | 213 kB 00:00 (122/154): libstdc++-13.3.1-3.fc39.aarch64.rpm 178 MB/s | 819 kB 00:00 (123/154): libtirpc-1.3.6-0.fc39.aarch64.rpm 21 MB/s | 95 kB 00:00 (124/154): libuuid-2.39.4-1.fc39.aarch64.rpm 10 MB/s | 28 kB 00:00 (125/154): lua-srpm-macros-1-13.fc39.noarch.rpm 5.8 MB/s | 8.7 kB 00:00 (126/154): libzstd-1.5.6-1.fc39.aarch64.rpm 83 MB/s | 284 kB 00:00 (127/154): ncurses-base-6.4-7.20230520.fc39.1.n 27 MB/s | 88 kB 00:00 (128/154): ncurses-libs-6.4-7.20230520.fc39.1.a 102 MB/s | 326 kB 00:00 (129/154): openldap-2.6.7-1.fc39.aarch64.rpm 75 MB/s | 250 kB 00:00 (130/154): p11-kit-0.25.5-1.fc39.aarch64.rpm 114 MB/s | 495 kB 00:00 (131/154): openssl-libs-3.1.4-4.fc39.aarch64.rp 205 MB/s | 2.0 MB 00:00 (132/154): p11-kit-trust-0.25.5-1.fc39.aarch64. 20 MB/s | 138 kB 00:00 (133/154): pam-1.5.3-3.fc39.aarch64.rpm 106 MB/s | 552 kB 00:00 (134/154): publicsuffix-list-dafsa-20240107-1.f 22 MB/s | 58 kB 00:00 (135/154): pam-libs-1.5.3-3.fc39.aarch64.rpm 16 MB/s | 57 kB 00:00 (136/154): pyproject-srpm-macros-1.16.0-1.fc39. 5.9 MB/s | 14 kB 00:00 (137/154): python-srpm-macros-3.12-8.fc39.noarc 11 MB/s | 23 kB 00:00 (138/154): qt5-srpm-macros-5.15.14-2.fc39.noarc 4.1 MB/s | 8.9 kB 00:00 (139/154): qt6-srpm-macros-6.6.2-1.fc39.noarch. 5.8 MB/s | 8.9 kB 00:00 (140/154): readline-8.2-6.fc39.aarch64.rpm 57 MB/s | 212 kB 00:00 (141/154): redhat-rpm-config-266-1.fc39.noarch. 19 MB/s | 78 kB 00:00 (142/154): rpm-build-libs-4.19.1.1-1.fc39.aarch 40 MB/s | 91 kB 00:00 (143/154): rpm-build-4.19.1.1-1.fc39.aarch64.rp 21 MB/s | 79 kB 00:00 (144/154): rpm-libs-4.19.1.1-1.fc39.aarch64.rpm 88 MB/s | 305 kB 00:00 (145/154): rpm-sequoia-1.7.0-1.fc39.aarch64.rpm 170 MB/s | 868 kB 00:00 (146/154): rpmautospec-rpm-macros-0.7.3-1.fc39. 4.4 MB/s | 11 kB 00:00 (147/154): rpm-4.19.1.1-1.fc39.aarch64.rpm 33 MB/s | 536 kB 00:00 (148/154): rust-srpm-macros-26.3-1.fc39.noarch. 2.9 MB/s | 13 kB 00:00 (149/154): shadow-utils-4.14.0-2.fc39.aarch64.r 103 MB/s | 1.3 MB 00:00 (150/154): systemd-libs-254.20-1.fc39.aarch64.r 56 MB/s | 660 kB 00:00 (151/154): util-linux-2.39.4-1.fc39.aarch64.rpm 77 MB/s | 1.2 MB 00:00 (152/154): xxhash-libs-0.8.2-4.fc39.aarch64.rpm 6.7 MB/s | 34 kB 00:00 (153/154): util-linux-core-2.39.4-1.fc39.aarch6 55 MB/s | 505 kB 00:00 (154/154): zstd-1.5.6-1.fc39.aarch64.rpm 80 MB/s | 445 kB 00:00 -------------------------------------------------------------------------------- Total 89 MB/s | 52 MB 00:00 fedora 1.6 MB/s | 1.6 kB 00:00 Importing GPG key 0x18B8E74C: Userid : "Fedora (39) " Fingerprint: E8F2 3996 F232 1864 0CB4 4CBE 75CF 5AC4 18B8 E74C From : /usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-39-primary Key imported successfully Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Running scriptlet: filesystem-3.18-6.fc39.aarch64 1/1 Preparing : 1/1 Installing : libgcc-13.3.1-3.fc39.aarch64 1/154 Running scriptlet: libgcc-13.3.1-3.fc39.aarch64 1/154 Installing : crypto-policies-20231204-1.git1e3a2e4.fc39.noarc 2/154 Running scriptlet: crypto-policies-20231204-1.git1e3a2e4.fc39.noarc 2/154 Installing : fedora-release-identity-basic-39-36.noarch 3/154 Installing : fedora-gpg-keys-39-2.noarch 4/154 Installing : fedora-repos-39-2.noarch 5/154 Installing : fedora-release-common-39-36.noarch 6/154 Installing : fedora-release-39-36.noarch 7/154 Installing : setup-2.14.4-1.fc39.noarch 8/154 warning: /etc/hosts created as /etc/hosts.rpmnew Running scriptlet: setup-2.14.4-1.fc39.noarch 8/154 Installing : filesystem-3.18-6.fc39.aarch64 9/154 Installing : basesystem-11-18.fc39.noarch 10/154 Installing : rust-srpm-macros-26.3-1.fc39.noarch 11/154 Installing : qt6-srpm-macros-6.6.2-1.fc39.noarch 12/154 Installing : qt5-srpm-macros-5.15.14-2.fc39.noarch 13/154 Installing : publicsuffix-list-dafsa-20240107-1.fc39.noarch 14/154 Installing : ncurses-base-6.4-7.20230520.fc39.1.noarch 15/154 Installing : glibc-gconv-extra-2.38-99.fc39.aarch64 16/154 Running scriptlet: glibc-gconv-extra-2.38-99.fc39.aarch64 16/154 Installing : glibc-minimal-langpack-2.38-99.fc39.aarch64 17/154 Installing : glibc-common-2.38-99.fc39.aarch64 18/154 Running scriptlet: glibc-2.38-99.fc39.aarch64 19/154 Installing : glibc-2.38-99.fc39.aarch64 19/154 Running scriptlet: glibc-2.38-99.fc39.aarch64 19/154 Installing : ncurses-libs-6.4-7.20230520.fc39.1.aarch64 20/154 Installing : bash-5.2.26-1.fc39.aarch64 21/154 Running scriptlet: bash-5.2.26-1.fc39.aarch64 21/154 Installing : zlib-1.2.13-4.fc39.aarch64 22/154 Installing : xz-libs-5.4.4-1.fc39.aarch64 23/154 Installing : bzip2-libs-1.0.8-16.fc39.aarch64 24/154 Installing : popt-1.19-3.fc39.aarch64 25/154 Installing : libstdc++-13.3.1-3.fc39.aarch64 26/154 Installing : libuuid-2.39.4-1.fc39.aarch64 27/154 Installing : libzstd-1.5.6-1.fc39.aarch64 28/154 Installing : elfutils-libelf-0.192-4.fc39.aarch64 29/154 Installing : libblkid-2.39.4-1.fc39.aarch64 30/154 Installing : readline-8.2-6.fc39.aarch64 31/154 Installing : gmp-1:6.2.1-5.fc39.aarch64 32/154 Installing : libattr-2.5.1-8.fc39.aarch64 33/154 Installing : libacl-2.3.1-9.fc39.aarch64 34/154 Installing : libxcrypt-4.4.36-2.fc39.aarch64 35/154 Installing : libcap-2.48-9.fc39.aarch64 36/154 Installing : lz4-libs-1.9.4-4.fc39.aarch64 37/154 Installing : libeconf-0.5.2-2.fc39.aarch64 38/154 Installing : systemd-libs-254.20-1.fc39.aarch64 39/154 Installing : mpfr-4.2.0-3.fc39.aarch64 40/154 Installing : dwz-0.15-3.fc39.aarch64 41/154 Installing : unzip-6.0-62.fc39.aarch64 42/154 Installing : file-libs-5.44-5.fc39.aarch64 43/154 Installing : file-5.44-5.fc39.aarch64 44/154 Installing : jansson-2.13.1-7.fc39.aarch64 45/154 Installing : libcap-ng-0.8.3-8.fc39.aarch64 46/154 Installing : audit-libs-3.1.5-1.fc39.aarch64 47/154 Installing : pam-libs-1.5.3-3.fc39.aarch64 48/154 Installing : libcom_err-1.47.0-2.fc39.aarch64 49/154 Installing : libsepol-3.5-2.fc39.aarch64 50/154 Installing : libtasn1-4.19.0-3.fc39.aarch64 51/154 Installing : libunistring-1.1-5.fc39.aarch64 52/154 Installing : libidn2-2.3.7-1.fc39.aarch64 53/154 Installing : lua-libs-5.4.6-3.fc39.aarch64 54/154 Installing : alternatives-1.26-1.fc39.aarch64 55/154 Installing : libsmartcols-2.39.4-1.fc39.aarch64 56/154 Installing : libpsl-0.21.2-4.fc39.aarch64 57/154 Installing : zip-3.0-39.fc39.aarch64 58/154 Installing : zstd-1.5.6-1.fc39.aarch64 59/154 Installing : libfdisk-2.39.4-1.fc39.aarch64 60/154 Installing : bzip2-1.0.8-16.fc39.aarch64 61/154 Installing : libxml2-2.10.4-3.fc39.aarch64 62/154 Installing : sqlite-libs-3.42.0-7.fc39.aarch64 63/154 Installing : ed-1.19-4.fc39.aarch64 64/154 Installing : elfutils-default-yama-scope-0.192-4.fc39.noarch 65/154 Running scriptlet: elfutils-default-yama-scope-0.192-4.fc39.noarch 65/154 Installing : cpio-2.14-4.fc39.aarch64 66/154 Installing : diffutils-3.10-3.fc39.aarch64 67/154 Installing : gdbm-libs-1:1.23-4.fc39.aarch64 68/154 Installing : cyrus-sasl-lib-2.1.28-11.fc39.aarch64 69/154 Installing : json-c-0.17-1.fc39.aarch64 70/154 Installing : libbrotli-1.1.0-1.fc39.aarch64 71/154 Installing : libdb-5.3.28-56.fc39.aarch64 72/154 Installing : libffi-3.4.4-4.fc39.aarch64 73/154 Installing : p11-kit-0.25.5-1.fc39.aarch64 74/154 Installing : p11-kit-trust-0.25.5-1.fc39.aarch64 75/154 Running scriptlet: p11-kit-trust-0.25.5-1.fc39.aarch64 75/154 Installing : libpkgconf-1.9.5-2.fc39.aarch64 76/154 Installing : pkgconf-1.9.5-2.fc39.aarch64 77/154 Installing : libsigsegv-2.14-5.fc39.aarch64 78/154 Installing : gawk-5.2.2-2.fc39.aarch64 79/154 Installing : libtool-ltdl-2.4.7-7.fc39.aarch64 80/154 Installing : libverto-0.3.2-6.fc39.aarch64 81/154 Installing : keyutils-libs-1.6.3-1.fc39.aarch64 82/154 Installing : libgomp-13.3.1-3.fc39.aarch64 83/154 Installing : libnghttp2-1.55.1-5.fc39.aarch64 84/154 Installing : xxhash-libs-0.8.2-4.fc39.aarch64 85/154 Installing : libssh-config-0.10.6-2.fc39.noarch 86/154 Installing : coreutils-common-9.3-7.fc39.aarch64 87/154 Installing : ansible-srpm-macros-1-12.fc39.noarch 88/154 Installing : pkgconf-m4-1.9.5-2.fc39.noarch 89/154 Installing : pkgconf-pkg-config-1.9.5-2.fc39.aarch64 90/154 Installing : perl-srpm-macros-1-51.fc39.noarch 91/154 Installing : pcre2-syntax-10.42-1.fc39.2.noarch 92/154 Installing : pcre2-10.42-1.fc39.2.aarch64 93/154 Installing : libselinux-3.5-5.fc39.aarch64 94/154 Installing : sed-4.8-14.fc39.aarch64 95/154 Installing : grep-3.11-3.fc39.aarch64 96/154 Installing : findutils-1:4.9.0-6.fc39.aarch64 97/154 Installing : xz-5.4.4-1.fc39.aarch64 98/154 Installing : libmount-2.39.4-1.fc39.aarch64 99/154 Installing : util-linux-core-2.39.4-1.fc39.aarch64 100/154 Installing : openssl-libs-1:3.1.4-4.fc39.aarch64 101/154 Installing : coreutils-9.3-7.fc39.aarch64 102/154 Running scriptlet: ca-certificates-2024.2.69_v8.0.401-1.0.fc39.noar 103/154 Installing : ca-certificates-2024.2.69_v8.0.401-1.0.fc39.noar 103/154 Running scriptlet: ca-certificates-2024.2.69_v8.0.401-1.0.fc39.noar 103/154 Installing : krb5-libs-1.21.3-2.fc39.aarch64 104/154 Installing : libtirpc-1.3.6-0.fc39.aarch64 105/154 Running scriptlet: authselect-libs-1.4.3-1.fc39.aarch64 106/154 Installing : authselect-libs-1.4.3-1.fc39.aarch64 106/154 Installing : gzip-1.12-6.fc39.aarch64 107/154 Installing : libarchive-3.7.1-3.fc39.aarch64 108/154 Installing : cracklib-2.9.11-2.fc39.aarch64 109/154 Installing : libpwquality-1.4.5-6.fc39.aarch64 110/154 Installing : authselect-1.4.3-1.fc39.aarch64 111/154 Installing : libnsl2-2.0.0-6.fc39.aarch64 112/154 Installing : pam-1.5.3-3.fc39.aarch64 113/154 Installing : libssh-0.10.6-2.fc39.aarch64 114/154 Installing : libevent-2.1.12-9.fc39.aarch64 115/154 Installing : openldap-2.6.7-1.fc39.aarch64 116/154 Installing : libcurl-8.2.1-5.fc39.aarch64 117/154 Installing : elfutils-libs-0.192-4.fc39.aarch64 118/154 Installing : elfutils-debuginfod-client-0.192-4.fc39.aarch64 119/154 Installing : binutils-gold-2.40-14.fc39.aarch64 120/154 Running scriptlet: binutils-gold-2.40-14.fc39.aarch64 120/154 Installing : binutils-2.40-14.fc39.aarch64 121/154 Running scriptlet: binutils-2.40-14.fc39.aarch64 121/154 Installing : elfutils-0.192-4.fc39.aarch64 122/154 Installing : gdb-minimal-15.1-1.fc39.aarch64 123/154 Installing : debugedit-5.0-12.fc39.aarch64 124/154 Installing : curl-8.2.1-5.fc39.aarch64 125/154 Installing : rpm-sequoia-1.7.0-1.fc39.aarch64 126/154 Installing : rpm-libs-4.19.1.1-1.fc39.aarch64 127/154 Running scriptlet: rpm-4.19.1.1-1.fc39.aarch64 128/154 Installing : rpm-4.19.1.1-1.fc39.aarch64 128/154 Installing : efi-srpm-macros-5-9.fc39.noarch 129/154 Installing : lua-srpm-macros-1-13.fc39.noarch 130/154 Installing : rpmautospec-rpm-macros-0.7.3-1.fc39.noarch 131/154 Installing : rpm-build-libs-4.19.1.1-1.fc39.aarch64 132/154 Installing : libsemanage-3.5-4.fc39.aarch64 133/154 Installing : shadow-utils-2:4.14.0-2.fc39.aarch64 134/154 Running scriptlet: libutempter-1.2.1-10.fc39.aarch64 135/154 Installing : libutempter-1.2.1-10.fc39.aarch64 135/154 Installing : patch-2.7.6-22.fc39.aarch64 136/154 Installing : tar-2:1.35-2.fc39.aarch64 137/154 Installing : package-notes-srpm-macros-0.5-9.fc39.noarch 138/154 Installing : openblas-srpm-macros-2-14.fc39.noarch 139/154 Installing : ocaml-srpm-macros-8-2.fc39.noarch 140/154 Installing : kernel-srpm-macros-1.0-20.fc39.noarch 141/154 Installing : gnat-srpm-macros-6-3.fc39.noarch 142/154 Installing : ghc-srpm-macros-1.6.1-2.fc39.noarch 143/154 Installing : fpc-srpm-macros-1.3-8.fc39.noarch 144/154 Installing : fonts-srpm-macros-1:2.0.5-12.fc39.noarch 145/154 Installing : forge-srpm-macros-0.3.1-1.fc39.noarch 146/154 Installing : go-srpm-macros-3.5.0-1.fc39.noarch 147/154 Installing : python-srpm-macros-3.12-8.fc39.noarch 148/154 Installing : redhat-rpm-config-266-1.fc39.noarch 149/154 Installing : rpm-build-4.19.1.1-1.fc39.aarch64 150/154 Installing : pyproject-srpm-macros-1.16.0-1.fc39.noarch 151/154 Installing : util-linux-2.39.4-1.fc39.aarch64 152/154 Running scriptlet: util-linux-2.39.4-1.fc39.aarch64 152/154 Installing : which-2.21-40.fc39.aarch64 153/154 Installing : info-7.0.3-3.fc39.aarch64 154/154 Running scriptlet: filesystem-3.18-6.fc39.aarch64 154/154 Running scriptlet: ca-certificates-2024.2.69_v8.0.401-1.0.fc39.noar 154/154 Running scriptlet: authselect-libs-1.4.3-1.fc39.aarch64 154/154 Running scriptlet: rpm-4.19.1.1-1.fc39.aarch64 154/154 Running scriptlet: info-7.0.3-3.fc39.aarch64 154/154 Verifying : glibc-2.38-99.fc39.aarch64 1/154 Verifying : glibc-common-2.38-99.fc39.aarch64 2/154 Verifying : glibc-gconv-extra-2.38-99.fc39.aarch64 3/154 Verifying : glibc-minimal-langpack-2.38-99.fc39.aarch64 4/154 Verifying : authselect-1.4.3-1.fc39.aarch64 5/154 Verifying : authselect-libs-1.4.3-1.fc39.aarch64 6/154 Verifying : basesystem-11-18.fc39.noarch 7/154 Verifying : bzip2-1.0.8-16.fc39.aarch64 8/154 Verifying : bzip2-libs-1.0.8-16.fc39.aarch64 9/154 Verifying : cpio-2.14-4.fc39.aarch64 10/154 Verifying : cracklib-2.9.11-2.fc39.aarch64 11/154 Verifying : cyrus-sasl-lib-2.1.28-11.fc39.aarch64 12/154 Verifying : diffutils-3.10-3.fc39.aarch64 13/154 Verifying : dwz-0.15-3.fc39.aarch64 14/154 Verifying : ed-1.19-4.fc39.aarch64 15/154 Verifying : efi-srpm-macros-5-9.fc39.noarch 16/154 Verifying : file-5.44-5.fc39.aarch64 17/154 Verifying : file-libs-5.44-5.fc39.aarch64 18/154 Verifying : filesystem-3.18-6.fc39.aarch64 19/154 Verifying : fonts-srpm-macros-1:2.0.5-12.fc39.noarch 20/154 Verifying : fpc-srpm-macros-1.3-8.fc39.noarch 21/154 Verifying : gawk-5.2.2-2.fc39.aarch64 22/154 Verifying : gdbm-libs-1:1.23-4.fc39.aarch64 23/154 Verifying : ghc-srpm-macros-1.6.1-2.fc39.noarch 24/154 Verifying : gmp-1:6.2.1-5.fc39.aarch64 25/154 Verifying : gnat-srpm-macros-6-3.fc39.noarch 26/154 Verifying : grep-3.11-3.fc39.aarch64 27/154 Verifying : gzip-1.12-6.fc39.aarch64 28/154 Verifying : info-7.0.3-3.fc39.aarch64 29/154 Verifying : jansson-2.13.1-7.fc39.aarch64 30/154 Verifying : json-c-0.17-1.fc39.aarch64 31/154 Verifying : kernel-srpm-macros-1.0-20.fc39.noarch 32/154 Verifying : libattr-2.5.1-8.fc39.aarch64 33/154 Verifying : libbrotli-1.1.0-1.fc39.aarch64 34/154 Verifying : libcap-ng-0.8.3-8.fc39.aarch64 35/154 Verifying : libcom_err-1.47.0-2.fc39.aarch64 36/154 Verifying : libdb-5.3.28-56.fc39.aarch64 37/154 Verifying : libevent-2.1.12-9.fc39.aarch64 38/154 Verifying : libffi-3.4.4-4.fc39.aarch64 39/154 Verifying : libnsl2-2.0.0-6.fc39.aarch64 40/154 Verifying : libpkgconf-1.9.5-2.fc39.aarch64 41/154 Verifying : libpsl-0.21.2-4.fc39.aarch64 42/154 Verifying : libpwquality-1.4.5-6.fc39.aarch64 43/154 Verifying : libselinux-3.5-5.fc39.aarch64 44/154 Verifying : libsemanage-3.5-4.fc39.aarch64 45/154 Verifying : libsepol-3.5-2.fc39.aarch64 46/154 Verifying : libsigsegv-2.14-5.fc39.aarch64 47/154 Verifying : libtasn1-4.19.0-3.fc39.aarch64 48/154 Verifying : libtool-ltdl-2.4.7-7.fc39.aarch64 49/154 Verifying : libunistring-1.1-5.fc39.aarch64 50/154 Verifying : libutempter-1.2.1-10.fc39.aarch64 51/154 Verifying : libverto-0.3.2-6.fc39.aarch64 52/154 Verifying : libxcrypt-4.4.36-2.fc39.aarch64 53/154 Verifying : libxml2-2.10.4-3.fc39.aarch64 54/154 Verifying : lua-libs-5.4.6-3.fc39.aarch64 55/154 Verifying : lz4-libs-1.9.4-4.fc39.aarch64 56/154 Verifying : mpfr-4.2.0-3.fc39.aarch64 57/154 Verifying : ocaml-srpm-macros-8-2.fc39.noarch 58/154 Verifying : openblas-srpm-macros-2-14.fc39.noarch 59/154 Verifying : package-notes-srpm-macros-0.5-9.fc39.noarch 60/154 Verifying : patch-2.7.6-22.fc39.aarch64 61/154 Verifying : pcre2-10.42-1.fc39.2.aarch64 62/154 Verifying : pcre2-syntax-10.42-1.fc39.2.noarch 63/154 Verifying : perl-srpm-macros-1-51.fc39.noarch 64/154 Verifying : pkgconf-1.9.5-2.fc39.aarch64 65/154 Verifying : pkgconf-m4-1.9.5-2.fc39.noarch 66/154 Verifying : pkgconf-pkg-config-1.9.5-2.fc39.aarch64 67/154 Verifying : popt-1.19-3.fc39.aarch64 68/154 Verifying : sed-4.8-14.fc39.aarch64 69/154 Verifying : setup-2.14.4-1.fc39.noarch 70/154 Verifying : sqlite-libs-3.42.0-7.fc39.aarch64 71/154 Verifying : tar-2:1.35-2.fc39.aarch64 72/154 Verifying : unzip-6.0-62.fc39.aarch64 73/154 Verifying : which-2.21-40.fc39.aarch64 74/154 Verifying : xz-5.4.4-1.fc39.aarch64 75/154 Verifying : xz-libs-5.4.4-1.fc39.aarch64 76/154 Verifying : zip-3.0-39.fc39.aarch64 77/154 Verifying : zlib-1.2.13-4.fc39.aarch64 78/154 Verifying : alternatives-1.26-1.fc39.aarch64 79/154 Verifying : ansible-srpm-macros-1-12.fc39.noarch 80/154 Verifying : audit-libs-3.1.5-1.fc39.aarch64 81/154 Verifying : bash-5.2.26-1.fc39.aarch64 82/154 Verifying : binutils-2.40-14.fc39.aarch64 83/154 Verifying : binutils-gold-2.40-14.fc39.aarch64 84/154 Verifying : ca-certificates-2024.2.69_v8.0.401-1.0.fc39.noar 85/154 Verifying : coreutils-9.3-7.fc39.aarch64 86/154 Verifying : coreutils-common-9.3-7.fc39.aarch64 87/154 Verifying : crypto-policies-20231204-1.git1e3a2e4.fc39.noarc 88/154 Verifying : curl-8.2.1-5.fc39.aarch64 89/154 Verifying : debugedit-5.0-12.fc39.aarch64 90/154 Verifying : elfutils-0.192-4.fc39.aarch64 91/154 Verifying : elfutils-debuginfod-client-0.192-4.fc39.aarch64 92/154 Verifying : elfutils-default-yama-scope-0.192-4.fc39.noarch 93/154 Verifying : elfutils-libelf-0.192-4.fc39.aarch64 94/154 Verifying : elfutils-libs-0.192-4.fc39.aarch64 95/154 Verifying : fedora-gpg-keys-39-2.noarch 96/154 Verifying : fedora-release-39-36.noarch 97/154 Verifying : fedora-release-common-39-36.noarch 98/154 Verifying : fedora-release-identity-basic-39-36.noarch 99/154 Verifying : fedora-repos-39-2.noarch 100/154 Verifying : findutils-1:4.9.0-6.fc39.aarch64 101/154 Verifying : forge-srpm-macros-0.3.1-1.fc39.noarch 102/154 Verifying : gdb-minimal-15.1-1.fc39.aarch64 103/154 Verifying : go-srpm-macros-3.5.0-1.fc39.noarch 104/154 Verifying : keyutils-libs-1.6.3-1.fc39.aarch64 105/154 Verifying : krb5-libs-1.21.3-2.fc39.aarch64 106/154 Verifying : libacl-2.3.1-9.fc39.aarch64 107/154 Verifying : libarchive-3.7.1-3.fc39.aarch64 108/154 Verifying : libblkid-2.39.4-1.fc39.aarch64 109/154 Verifying : libcap-2.48-9.fc39.aarch64 110/154 Verifying : libcurl-8.2.1-5.fc39.aarch64 111/154 Verifying : libeconf-0.5.2-2.fc39.aarch64 112/154 Verifying : libfdisk-2.39.4-1.fc39.aarch64 113/154 Verifying : libgcc-13.3.1-3.fc39.aarch64 114/154 Verifying : libgomp-13.3.1-3.fc39.aarch64 115/154 Verifying : libidn2-2.3.7-1.fc39.aarch64 116/154 Verifying : libmount-2.39.4-1.fc39.aarch64 117/154 Verifying : libnghttp2-1.55.1-5.fc39.aarch64 118/154 Verifying : libsmartcols-2.39.4-1.fc39.aarch64 119/154 Verifying : libssh-0.10.6-2.fc39.aarch64 120/154 Verifying : libssh-config-0.10.6-2.fc39.noarch 121/154 Verifying : libstdc++-13.3.1-3.fc39.aarch64 122/154 Verifying : libtirpc-1.3.6-0.fc39.aarch64 123/154 Verifying : libuuid-2.39.4-1.fc39.aarch64 124/154 Verifying : libzstd-1.5.6-1.fc39.aarch64 125/154 Verifying : lua-srpm-macros-1-13.fc39.noarch 126/154 Verifying : ncurses-base-6.4-7.20230520.fc39.1.noarch 127/154 Verifying : ncurses-libs-6.4-7.20230520.fc39.1.aarch64 128/154 Verifying : openldap-2.6.7-1.fc39.aarch64 129/154 Verifying : openssl-libs-1:3.1.4-4.fc39.aarch64 130/154 Verifying : p11-kit-0.25.5-1.fc39.aarch64 131/154 Verifying : p11-kit-trust-0.25.5-1.fc39.aarch64 132/154 Verifying : pam-1.5.3-3.fc39.aarch64 133/154 Verifying : pam-libs-1.5.3-3.fc39.aarch64 134/154 Verifying : publicsuffix-list-dafsa-20240107-1.fc39.noarch 135/154 Verifying : pyproject-srpm-macros-1.16.0-1.fc39.noarch 136/154 Verifying : python-srpm-macros-3.12-8.fc39.noarch 137/154 Verifying : qt5-srpm-macros-5.15.14-2.fc39.noarch 138/154 Verifying : qt6-srpm-macros-6.6.2-1.fc39.noarch 139/154 Verifying : readline-8.2-6.fc39.aarch64 140/154 Verifying : redhat-rpm-config-266-1.fc39.noarch 141/154 Verifying : rpm-4.19.1.1-1.fc39.aarch64 142/154 Verifying : rpm-build-4.19.1.1-1.fc39.aarch64 143/154 Verifying : rpm-build-libs-4.19.1.1-1.fc39.aarch64 144/154 Verifying : rpm-libs-4.19.1.1-1.fc39.aarch64 145/154 Verifying : rpm-sequoia-1.7.0-1.fc39.aarch64 146/154 Verifying : rpmautospec-rpm-macros-0.7.3-1.fc39.noarch 147/154 Verifying : rust-srpm-macros-26.3-1.fc39.noarch 148/154 Verifying : shadow-utils-2:4.14.0-2.fc39.aarch64 149/154 Verifying : systemd-libs-254.20-1.fc39.aarch64 150/154 Verifying : util-linux-2.39.4-1.fc39.aarch64 151/154 Verifying : util-linux-core-2.39.4-1.fc39.aarch64 152/154 Verifying : xxhash-libs-0.8.2-4.fc39.aarch64 153/154 Verifying : zstd-1.5.6-1.fc39.aarch64 154/154 Installed: alternatives-1.26-1.fc39.aarch64 ansible-srpm-macros-1-12.fc39.noarch audit-libs-3.1.5-1.fc39.aarch64 authselect-1.4.3-1.fc39.aarch64 authselect-libs-1.4.3-1.fc39.aarch64 basesystem-11-18.fc39.noarch bash-5.2.26-1.fc39.aarch64 binutils-2.40-14.fc39.aarch64 binutils-gold-2.40-14.fc39.aarch64 bzip2-1.0.8-16.fc39.aarch64 bzip2-libs-1.0.8-16.fc39.aarch64 ca-certificates-2024.2.69_v8.0.401-1.0.fc39.noarch coreutils-9.3-7.fc39.aarch64 coreutils-common-9.3-7.fc39.aarch64 cpio-2.14-4.fc39.aarch64 cracklib-2.9.11-2.fc39.aarch64 crypto-policies-20231204-1.git1e3a2e4.fc39.noarch curl-8.2.1-5.fc39.aarch64 cyrus-sasl-lib-2.1.28-11.fc39.aarch64 debugedit-5.0-12.fc39.aarch64 diffutils-3.10-3.fc39.aarch64 dwz-0.15-3.fc39.aarch64 ed-1.19-4.fc39.aarch64 efi-srpm-macros-5-9.fc39.noarch elfutils-0.192-4.fc39.aarch64 elfutils-debuginfod-client-0.192-4.fc39.aarch64 elfutils-default-yama-scope-0.192-4.fc39.noarch elfutils-libelf-0.192-4.fc39.aarch64 elfutils-libs-0.192-4.fc39.aarch64 fedora-gpg-keys-39-2.noarch fedora-release-39-36.noarch fedora-release-common-39-36.noarch fedora-release-identity-basic-39-36.noarch fedora-repos-39-2.noarch file-5.44-5.fc39.aarch64 file-libs-5.44-5.fc39.aarch64 filesystem-3.18-6.fc39.aarch64 findutils-1:4.9.0-6.fc39.aarch64 fonts-srpm-macros-1:2.0.5-12.fc39.noarch forge-srpm-macros-0.3.1-1.fc39.noarch fpc-srpm-macros-1.3-8.fc39.noarch gawk-5.2.2-2.fc39.aarch64 gdb-minimal-15.1-1.fc39.aarch64 gdbm-libs-1:1.23-4.fc39.aarch64 ghc-srpm-macros-1.6.1-2.fc39.noarch glibc-2.38-99.fc39.aarch64 glibc-common-2.38-99.fc39.aarch64 glibc-gconv-extra-2.38-99.fc39.aarch64 glibc-minimal-langpack-2.38-99.fc39.aarch64 gmp-1:6.2.1-5.fc39.aarch64 gnat-srpm-macros-6-3.fc39.noarch go-srpm-macros-3.5.0-1.fc39.noarch grep-3.11-3.fc39.aarch64 gzip-1.12-6.fc39.aarch64 info-7.0.3-3.fc39.aarch64 jansson-2.13.1-7.fc39.aarch64 json-c-0.17-1.fc39.aarch64 kernel-srpm-macros-1.0-20.fc39.noarch keyutils-libs-1.6.3-1.fc39.aarch64 krb5-libs-1.21.3-2.fc39.aarch64 libacl-2.3.1-9.fc39.aarch64 libarchive-3.7.1-3.fc39.aarch64 libattr-2.5.1-8.fc39.aarch64 libblkid-2.39.4-1.fc39.aarch64 libbrotli-1.1.0-1.fc39.aarch64 libcap-2.48-9.fc39.aarch64 libcap-ng-0.8.3-8.fc39.aarch64 libcom_err-1.47.0-2.fc39.aarch64 libcurl-8.2.1-5.fc39.aarch64 libdb-5.3.28-56.fc39.aarch64 libeconf-0.5.2-2.fc39.aarch64 libevent-2.1.12-9.fc39.aarch64 libfdisk-2.39.4-1.fc39.aarch64 libffi-3.4.4-4.fc39.aarch64 libgcc-13.3.1-3.fc39.aarch64 libgomp-13.3.1-3.fc39.aarch64 libidn2-2.3.7-1.fc39.aarch64 libmount-2.39.4-1.fc39.aarch64 libnghttp2-1.55.1-5.fc39.aarch64 libnsl2-2.0.0-6.fc39.aarch64 libpkgconf-1.9.5-2.fc39.aarch64 libpsl-0.21.2-4.fc39.aarch64 libpwquality-1.4.5-6.fc39.aarch64 libselinux-3.5-5.fc39.aarch64 libsemanage-3.5-4.fc39.aarch64 libsepol-3.5-2.fc39.aarch64 libsigsegv-2.14-5.fc39.aarch64 libsmartcols-2.39.4-1.fc39.aarch64 libssh-0.10.6-2.fc39.aarch64 libssh-config-0.10.6-2.fc39.noarch libstdc++-13.3.1-3.fc39.aarch64 libtasn1-4.19.0-3.fc39.aarch64 libtirpc-1.3.6-0.fc39.aarch64 libtool-ltdl-2.4.7-7.fc39.aarch64 libunistring-1.1-5.fc39.aarch64 libutempter-1.2.1-10.fc39.aarch64 libuuid-2.39.4-1.fc39.aarch64 libverto-0.3.2-6.fc39.aarch64 libxcrypt-4.4.36-2.fc39.aarch64 libxml2-2.10.4-3.fc39.aarch64 libzstd-1.5.6-1.fc39.aarch64 lua-libs-5.4.6-3.fc39.aarch64 lua-srpm-macros-1-13.fc39.noarch lz4-libs-1.9.4-4.fc39.aarch64 mpfr-4.2.0-3.fc39.aarch64 ncurses-base-6.4-7.20230520.fc39.1.noarch ncurses-libs-6.4-7.20230520.fc39.1.aarch64 ocaml-srpm-macros-8-2.fc39.noarch openblas-srpm-macros-2-14.fc39.noarch openldap-2.6.7-1.fc39.aarch64 openssl-libs-1:3.1.4-4.fc39.aarch64 p11-kit-0.25.5-1.fc39.aarch64 p11-kit-trust-0.25.5-1.fc39.aarch64 package-notes-srpm-macros-0.5-9.fc39.noarch pam-1.5.3-3.fc39.aarch64 pam-libs-1.5.3-3.fc39.aarch64 patch-2.7.6-22.fc39.aarch64 pcre2-10.42-1.fc39.2.aarch64 pcre2-syntax-10.42-1.fc39.2.noarch perl-srpm-macros-1-51.fc39.noarch pkgconf-1.9.5-2.fc39.aarch64 pkgconf-m4-1.9.5-2.fc39.noarch pkgconf-pkg-config-1.9.5-2.fc39.aarch64 popt-1.19-3.fc39.aarch64 publicsuffix-list-dafsa-20240107-1.fc39.noarch pyproject-srpm-macros-1.16.0-1.fc39.noarch python-srpm-macros-3.12-8.fc39.noarch qt5-srpm-macros-5.15.14-2.fc39.noarch qt6-srpm-macros-6.6.2-1.fc39.noarch readline-8.2-6.fc39.aarch64 redhat-rpm-config-266-1.fc39.noarch rpm-4.19.1.1-1.fc39.aarch64 rpm-build-4.19.1.1-1.fc39.aarch64 rpm-build-libs-4.19.1.1-1.fc39.aarch64 rpm-libs-4.19.1.1-1.fc39.aarch64 rpm-sequoia-1.7.0-1.fc39.aarch64 rpmautospec-rpm-macros-0.7.3-1.fc39.noarch rust-srpm-macros-26.3-1.fc39.noarch sed-4.8-14.fc39.aarch64 setup-2.14.4-1.fc39.noarch shadow-utils-2:4.14.0-2.fc39.aarch64 sqlite-libs-3.42.0-7.fc39.aarch64 systemd-libs-254.20-1.fc39.aarch64 tar-2:1.35-2.fc39.aarch64 unzip-6.0-62.fc39.aarch64 util-linux-2.39.4-1.fc39.aarch64 util-linux-core-2.39.4-1.fc39.aarch64 which-2.21-40.fc39.aarch64 xxhash-libs-0.8.2-4.fc39.aarch64 xz-5.4.4-1.fc39.aarch64 xz-libs-5.4.4-1.fc39.aarch64 zip-3.0-39.fc39.aarch64 zlib-1.2.13-4.fc39.aarch64 zstd-1.5.6-1.fc39.aarch64 Complete! Finish: installing minimal buildroot with dnf Start: creating root cache Finish: creating root cache Finish: chroot init INFO: Installed packages: INFO: alternatives-1.26-1.fc39.aarch64 ansible-srpm-macros-1-12.fc39.noarch audit-libs-3.1.5-1.fc39.aarch64 authselect-1.4.3-1.fc39.aarch64 authselect-libs-1.4.3-1.fc39.aarch64 basesystem-11-18.fc39.noarch bash-5.2.26-1.fc39.aarch64 binutils-2.40-14.fc39.aarch64 binutils-gold-2.40-14.fc39.aarch64 bzip2-1.0.8-16.fc39.aarch64 bzip2-libs-1.0.8-16.fc39.aarch64 ca-certificates-2024.2.69_v8.0.401-1.0.fc39.noarch coreutils-9.3-7.fc39.aarch64 coreutils-common-9.3-7.fc39.aarch64 cpio-2.14-4.fc39.aarch64 cracklib-2.9.11-2.fc39.aarch64 crypto-policies-20231204-1.git1e3a2e4.fc39.noarch curl-8.2.1-5.fc39.aarch64 cyrus-sasl-lib-2.1.28-11.fc39.aarch64 debugedit-5.0-12.fc39.aarch64 diffutils-3.10-3.fc39.aarch64 dwz-0.15-3.fc39.aarch64 ed-1.19-4.fc39.aarch64 efi-srpm-macros-5-9.fc39.noarch elfutils-0.192-4.fc39.aarch64 elfutils-debuginfod-client-0.192-4.fc39.aarch64 elfutils-default-yama-scope-0.192-4.fc39.noarch elfutils-libelf-0.192-4.fc39.aarch64 elfutils-libs-0.192-4.fc39.aarch64 fedora-gpg-keys-39-2.noarch fedora-release-39-36.noarch fedora-release-common-39-36.noarch fedora-release-identity-basic-39-36.noarch fedora-repos-39-2.noarch file-5.44-5.fc39.aarch64 file-libs-5.44-5.fc39.aarch64 filesystem-3.18-6.fc39.aarch64 findutils-4.9.0-6.fc39.aarch64 fonts-srpm-macros-2.0.5-12.fc39.noarch forge-srpm-macros-0.3.1-1.fc39.noarch fpc-srpm-macros-1.3-8.fc39.noarch gawk-5.2.2-2.fc39.aarch64 gdb-minimal-15.1-1.fc39.aarch64 gdbm-libs-1.23-4.fc39.aarch64 ghc-srpm-macros-1.6.1-2.fc39.noarch glibc-2.38-99.fc39.aarch64 glibc-common-2.38-99.fc39.aarch64 glibc-gconv-extra-2.38-99.fc39.aarch64 glibc-minimal-langpack-2.38-99.fc39.aarch64 gmp-6.2.1-5.fc39.aarch64 gnat-srpm-macros-6-3.fc39.noarch go-srpm-macros-3.5.0-1.fc39.noarch gpg-pubkey-18b8e74c-62f2920f grep-3.11-3.fc39.aarch64 gzip-1.12-6.fc39.aarch64 info-7.0.3-3.fc39.aarch64 jansson-2.13.1-7.fc39.aarch64 json-c-0.17-1.fc39.aarch64 kernel-srpm-macros-1.0-20.fc39.noarch keyutils-libs-1.6.3-1.fc39.aarch64 krb5-libs-1.21.3-2.fc39.aarch64 libacl-2.3.1-9.fc39.aarch64 libarchive-3.7.1-3.fc39.aarch64 libattr-2.5.1-8.fc39.aarch64 libblkid-2.39.4-1.fc39.aarch64 libbrotli-1.1.0-1.fc39.aarch64 libcap-2.48-9.fc39.aarch64 libcap-ng-0.8.3-8.fc39.aarch64 libcom_err-1.47.0-2.fc39.aarch64 libcurl-8.2.1-5.fc39.aarch64 libdb-5.3.28-56.fc39.aarch64 libeconf-0.5.2-2.fc39.aarch64 libevent-2.1.12-9.fc39.aarch64 libfdisk-2.39.4-1.fc39.aarch64 libffi-3.4.4-4.fc39.aarch64 libgcc-13.3.1-3.fc39.aarch64 libgomp-13.3.1-3.fc39.aarch64 libidn2-2.3.7-1.fc39.aarch64 libmount-2.39.4-1.fc39.aarch64 libnghttp2-1.55.1-5.fc39.aarch64 libnsl2-2.0.0-6.fc39.aarch64 libpkgconf-1.9.5-2.fc39.aarch64 libpsl-0.21.2-4.fc39.aarch64 libpwquality-1.4.5-6.fc39.aarch64 libselinux-3.5-5.fc39.aarch64 libsemanage-3.5-4.fc39.aarch64 libsepol-3.5-2.fc39.aarch64 libsigsegv-2.14-5.fc39.aarch64 libsmartcols-2.39.4-1.fc39.aarch64 libssh-0.10.6-2.fc39.aarch64 libssh-config-0.10.6-2.fc39.noarch libstdc++-13.3.1-3.fc39.aarch64 libtasn1-4.19.0-3.fc39.aarch64 libtirpc-1.3.6-0.fc39.aarch64 libtool-ltdl-2.4.7-7.fc39.aarch64 libunistring-1.1-5.fc39.aarch64 libutempter-1.2.1-10.fc39.aarch64 libuuid-2.39.4-1.fc39.aarch64 libverto-0.3.2-6.fc39.aarch64 libxcrypt-4.4.36-2.fc39.aarch64 libxml2-2.10.4-3.fc39.aarch64 libzstd-1.5.6-1.fc39.aarch64 lua-libs-5.4.6-3.fc39.aarch64 lua-srpm-macros-1-13.fc39.noarch lz4-libs-1.9.4-4.fc39.aarch64 mpfr-4.2.0-3.fc39.aarch64 ncurses-base-6.4-7.20230520.fc39.1.noarch ncurses-libs-6.4-7.20230520.fc39.1.aarch64 ocaml-srpm-macros-8-2.fc39.noarch openblas-srpm-macros-2-14.fc39.noarch openldap-2.6.7-1.fc39.aarch64 openssl-libs-3.1.4-4.fc39.aarch64 p11-kit-0.25.5-1.fc39.aarch64 p11-kit-trust-0.25.5-1.fc39.aarch64 package-notes-srpm-macros-0.5-9.fc39.noarch pam-1.5.3-3.fc39.aarch64 pam-libs-1.5.3-3.fc39.aarch64 patch-2.7.6-22.fc39.aarch64 pcre2-10.42-1.fc39.2.aarch64 pcre2-syntax-10.42-1.fc39.2.noarch perl-srpm-macros-1-51.fc39.noarch pkgconf-1.9.5-2.fc39.aarch64 pkgconf-m4-1.9.5-2.fc39.noarch pkgconf-pkg-config-1.9.5-2.fc39.aarch64 popt-1.19-3.fc39.aarch64 publicsuffix-list-dafsa-20240107-1.fc39.noarch pyproject-srpm-macros-1.16.0-1.fc39.noarch python-srpm-macros-3.12-8.fc39.noarch qt5-srpm-macros-5.15.14-2.fc39.noarch qt6-srpm-macros-6.6.2-1.fc39.noarch readline-8.2-6.fc39.aarch64 redhat-rpm-config-266-1.fc39.noarch rpm-4.19.1.1-1.fc39.aarch64 rpm-build-4.19.1.1-1.fc39.aarch64 rpm-build-libs-4.19.1.1-1.fc39.aarch64 rpm-libs-4.19.1.1-1.fc39.aarch64 rpm-sequoia-1.7.0-1.fc39.aarch64 rpmautospec-rpm-macros-0.7.3-1.fc39.noarch rust-srpm-macros-26.3-1.fc39.noarch sed-4.8-14.fc39.aarch64 setup-2.14.4-1.fc39.noarch shadow-utils-4.14.0-2.fc39.aarch64 sqlite-libs-3.42.0-7.fc39.aarch64 systemd-libs-254.20-1.fc39.aarch64 tar-1.35-2.fc39.aarch64 unzip-6.0-62.fc39.aarch64 util-linux-2.39.4-1.fc39.aarch64 util-linux-core-2.39.4-1.fc39.aarch64 which-2.21-40.fc39.aarch64 xxhash-libs-0.8.2-4.fc39.aarch64 xz-5.4.4-1.fc39.aarch64 xz-libs-5.4.4-1.fc39.aarch64 zip-3.0-39.fc39.aarch64 zlib-1.2.13-4.fc39.aarch64 zstd-1.5.6-1.fc39.aarch64 Start: buildsrpm Start: rpmbuild -bs sh: -c: line 1: unexpected EOF while looking for matching `"' Building target platforms: aarch64 Building for target aarch64 setting SOURCE_DATE_EPOCH=1636416000 Wrote: /builddir/build/SRPMS/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm Finish: rpmbuild -bs INFO: chroot_scan: 3 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-39-aarch64-1732846793.454290/root/var/log/dnf.log /var/lib/mock/fedora-39-aarch64-1732846793.454290/root/var/log/dnf.librepo.log /var/lib/mock/fedora-39-aarch64-1732846793.454290/root/var/log/dnf.rpm.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names Finish: buildsrpm INFO: Done(/var/lib/copr-rpmbuild/workspace/workdir-gtryy35o/cutlass/cutlass.spec) Config(child) 1 minutes 16 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot INFO: Start(/var/lib/copr-rpmbuild/results/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm) Config(fedora-39-aarch64) Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-39-aarch64-bootstrap-1732846793.454290/root. INFO: reusing tmpfs at /var/lib/mock/fedora-39-aarch64-bootstrap-1732846793.454290/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-39-aarch64-1732846793.454290/root. INFO: calling preinit hooks INFO: enabled root cache Start: unpacking root cache Finish: unpacking root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-4.19.1.1-1.fc39.aarch64 rpm-sequoia-1.7.0-1.fc39.aarch64 python3-dnf-4.21.1-1.fc39.noarch python3-dnf-plugins-core-4.9.0-1.fc39.noarch yum-4.21.1-1.fc39.noarch Finish: chroot init Start: build phase for cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm Start: build setup for cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm sh: -c: line 1: unexpected EOF while looking for matching `"' Building target platforms: aarch64 Building for target aarch64 setting SOURCE_DATE_EPOCH=1636416000 Wrote: /builddir/build/SRPMS/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm No matches found for the following disable plugin patterns: local, spacewalk, versionlock Copr repository 66 kB/s | 1.5 kB 00:00 Additional repo copr_rezso_CUDA 79 kB/s | 1.5 kB 00:00 Additional repo http_developer_download_nvidia_ 1.0 MB/s | 3.5 kB 00:00 Additional repo http_developer_download_nvidia_ 1.1 MB/s | 3.5 kB 00:00 fedora 569 kB/s | 16 kB 00:00 updates 449 kB/s | 15 kB 00:00 Dependencies resolved. ======================================================================================================================================================= Package Arch Version Repository Size ======================================================================================================================================================= Installing: cmake aarch64 3.30.5-1.fc39 updates 7.7 M cuda-cudart-devel-12-6 aarch64 12.6.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 2.0 M cuda-driver-devel-12-6 aarch64 12.6.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 43 k cuda-gcc-11-c++ aarch64 11.2.1-1.fc39 copr_base 13 M cuda-nvcc-12-6 aarch64 12.6.85-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 62 M cuda-nvml-devel-12-6 aarch64 12.6.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 231 k cuda-nvrtc-devel-12-6 aarch64 12.6.85-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 28 M cuda-nvtx-12-6 aarch64 12.6.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 89 k doxygen aarch64 2:1.9.7-3.fc39 fedora 4.8 M gcc-c++ aarch64 13.3.1-3.fc39 updates 12 M git aarch64 2.47.0-1.fc39 updates 51 k graphviz aarch64 8.1.0-6.fc39 updates 4.9 M libcublas-devel-12-6 aarch64 12.6.4.1-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 417 M libcudnn9-devel-cuda-12 aarch64 9.5.1.17-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 53 k libcurand-devel-12-6 aarch64 10.3.7.77-2 copr_rezso_CUDA 248 k python3-devel aarch64 3.12.7-1.fc39 updates 314 k python3-setuptools noarch 67.7.2-8.fc39 updates 1.5 M Installing dependencies: abattis-cantarell-vf-fonts noarch 0.301-10.fc39 fedora 121 k adobe-mappings-cmap noarch 20231115-1.fc39 updates 2.2 M adobe-mappings-cmap-deprecated noarch 20231115-1.fc39 updates 111 k adobe-mappings-pdf noarch 20190401-5.fc39 fedora 698 k annobin-docs noarch 12.60-1.fc39 updates 88 k annobin-plugin-gcc aarch64 12.60-1.fc39 updates 964 k avahi-libs aarch64 0.8-24.fc39 fedora 67 k cairo aarch64 1.18.0-1.fc39 fedora 692 k cairo-gobject aarch64 1.18.0-1.fc39 fedora 18 k clang16-libs aarch64 16.0.6-3.fc39 fedora 21 M clang16-resource-filesystem aarch64 16.0.6-3.fc39 fedora 13 k cmake-data noarch 3.30.5-1.fc39 updates 2.3 M cmake-filesystem aarch64 3.30.5-1.fc39 updates 17 k cmake-rpm-macros noarch 3.30.5-1.fc39 updates 17 k cpp aarch64 13.3.1-3.fc39 updates 9.6 M crypto-policies-scripts noarch 20231204-1.git1e3a2e4.fc39 updates 117 k cuda-cccl-12-6 aarch64 12.6.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 1.6 M cuda-crt-12-6 aarch64 12.6.85-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 110 k cuda-cudart-12-6 aarch64 12.6.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 236 k cuda-gcc-11 aarch64 11.2.1-1.fc39 copr_base 27 M cuda-nvrtc-12-6 aarch64 12.6.85-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 22 M cuda-nvvm-12-6 aarch64 12.6.85-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 23 M cuda-toolkit-12-6-config-common noarch 12.6.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_x86_64 7.7 k cuda-toolkit-12-config-common noarch 12.6.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_x86_64 7.9 k cuda-toolkit-config-common noarch 12.6.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_x86_64 7.9 k cups-libs aarch64 1:2.4.11-1.fc39 updates 268 k dbus-libs aarch64 1:1.14.10-1.fc39 fedora 156 k default-fonts-core-sans noarch 4.0-9.fc39 fedora 32 k emacs-filesystem noarch 1:29.4-2.fc39 updates 7.3 k expat aarch64 2.6.3-1.fc39 updates 112 k fontconfig aarch64 2.14.2-6.fc39 updates 302 k fonts-filesystem noarch 1:2.0.5-12.fc39 fedora 8.2 k freetype aarch64 2.13.1-2.fc39 fedora 406 k fribidi aarch64 1.0.13-2.fc39 fedora 91 k gc aarch64 8.2.2-4.fc39 fedora 110 k gcc aarch64 13.3.1-3.fc39 updates 31 M gcc-plugin-annobin aarch64 13.3.1-3.fc39 updates 58 k gd aarch64 2.3.3-12.fc39 fedora 133 k gdk-pixbuf2 aarch64 2.42.10-5.fc39 fedora 482 k git-core aarch64 2.47.0-1.fc39 updates 4.9 M git-core-doc noarch 2.47.0-1.fc39 updates 3.0 M glib2 aarch64 2.78.6-1.fc39 updates 2.8 M glibc-devel aarch64 2.38-99.fc39 copr_base 498 k gnutls aarch64 3.8.6-1.fc39 updates 1.1 M google-droid-sans-fonts noarch 20200215-17.fc39 fedora 2.7 M google-noto-fonts-common noarch 20240101-1.fc39 updates 17 k google-noto-sans-vf-fonts noarch 20240101-1.fc39 updates 593 k graphite2 aarch64 1.3.14-12.fc39 fedora 93 k groff-base aarch64 1.23.0-3.fc39 updates 1.1 M gts aarch64 0.7.6-46.20121130.fc39 fedora 234 k guile22 aarch64 2.2.7-9.fc39 fedora 6.5 M harfbuzz aarch64 8.2.1-2.fc39 fedora 934 k highway aarch64 1.1.0-1.fc39 updates 97 k isl aarch64 0.16.1-18.fc39 fedora 838 k jbig2dec-libs aarch64 0.19-10.fc39 fedora 71 k jbigkit-libs aarch64 2.1-26.fc39 fedora 53 k jsoncpp aarch64 1.9.5-5.fc39 fedora 91 k kernel-headers aarch64 6.11.3-100.fc39 updates 1.6 M lasi aarch64 1.1.3-11.fc39 fedora 53 k lcms2 aarch64 2.15-2.fc39 fedora 176 k less aarch64 633-4.fc39 updates 176 k libICE aarch64 1.0.10-11.fc39 fedora 70 k libSM aarch64 1.2.3-13.fc39 fedora 41 k libX11 aarch64 1.8.9-1.fc39 updates 639 k libX11-common noarch 1.8.9-1.fc39 updates 176 k libXau aarch64 1.0.11-3.fc39 fedora 32 k libXext aarch64 1.3.5-3.fc39 fedora 39 k libXft aarch64 2.3.8-3.fc39 fedora 71 k libXpm aarch64 3.5.17-1.fc39 updates 64 k libXrender aarch64 0.9.11-3.fc39 fedora 27 k libXt aarch64 1.2.1-5.fc39 fedora 176 k libaom aarch64 3.9.0-1.fc39 updates 1.5 M libasan aarch64 13.3.1-3.fc39 updates 459 k libatomic aarch64 13.3.1-3.fc39 updates 47 k libavif aarch64 0.11.1-11.fc39 fedora 80 k libb2 aarch64 0.98.1-9.fc39 fedora 24 k libcbor aarch64 0.10.2-2.fc39 fedora 57 k libcublas-12-6 aarch64 12.6.4.1-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 372 M libcudnn9-cuda-12 aarch64 9.5.1.17-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 543 M libcurand-12-6 aarch64 10.3.7.77-1 http_developer_download_nvidia_com_compute_cuda_repos_rhel9_sbsa 53 M libdatrie aarch64 0.2.13-7.fc39 fedora 32 k libdav1d aarch64 1.2.1-2.fc39 fedora 350 k libedit aarch64 3.1-53.20240808cvs.fc39 updates 107 k libfido2 aarch64 1.13.0-3.fc39 fedora 96 k libgs aarch64 10.02.1-7.fc39 updates 3.4 M libijs aarch64 0.35-19.fc39 fedora 29 k libimagequant aarch64 4.0.3-5.fc39 updates 286 k libjpeg-turbo aarch64 2.1.4-3.fc39 fedora 196 k libjxl aarch64 1:0.8.3-1.fc39 updates 775 k liblerc aarch64 4.0.0-4.fc39 fedora 179 k libmpc aarch64 1.3.1-3.fc39 fedora 72 k libpaper aarch64 1:2.1.1-1.fc39 fedora 27 k libpng aarch64 2:1.6.37-15.fc39 fedora 115 k librsvg2 aarch64 2.57.1-2.fc39 updates 1.5 M libstdc++-devel aarch64 13.3.1-3.fc39 updates 2.6 M libthai aarch64 0.1.29-6.fc39 fedora 213 k libtiff aarch64 4.4.0-10.fc39 updates 196 k libubsan aarch64 13.3.1-3.fc39 updates 214 k libuv aarch64 1:1.49.2-1.fc39 updates 256 k libwebp aarch64 1.3.2-2.fc39 fedora 243 k libxcb aarch64 1.13.1-12.fc39 fedora 238 k libxcrypt-devel aarch64 4.4.36-2.fc39 fedora 30 k llvm16-libs aarch64 16.0.6-5.fc39 fedora 25 M make aarch64 1:4.4.1-2.fc39 fedora 585 k mpdecimal aarch64 2.5.1-7.fc39 fedora 90 k ncurses aarch64 6.4-7.20230520.fc39.1 updates 414 k netpbm aarch64 11.02.00-2.fc39 fedora 183 k nettle aarch64 3.9.1-2.fc39 fedora 434 k nspr aarch64 4.35.0-24.fc39 updates 135 k nss aarch64 3.105.0-1.fc39 updates 703 k nss-softokn aarch64 3.105.0-1.fc39 updates 421 k nss-softokn-freebl aarch64 3.105.0-1.fc39 updates 301 k nss-sysinit aarch64 3.105.0-1.fc39 updates 18 k nss-util aarch64 3.105.0-1.fc39 updates 87 k openjpeg2 aarch64 2.5.2-1.fc39 updates 176 k openssh aarch64 9.3p1-11.fc39 updates 431 k openssh-clients aarch64 9.3p1-11.fc39 updates 731 k pango aarch64 1.51.0-1.fc39 fedora 339 k perl-AutoLoader noarch 5.74-502.fc39 updates 21 k perl-B aarch64 1.88-502.fc39 updates 178 k perl-Carp noarch 1.54-500.fc39 fedora 29 k perl-Class-Struct noarch 0.68-502.fc39 updates 22 k perl-Data-Dumper aarch64 2.188-501.fc39 fedora 55 k perl-Digest noarch 1.20-500.fc39 fedora 25 k perl-Digest-MD5 aarch64 2.58-500.fc39 fedora 36 k perl-DynaLoader aarch64 1.54-502.fc39 updates 26 k perl-Encode aarch64 4:3.19-500.fc39 fedora 1.7 M perl-Errno aarch64 1.37-502.fc39 updates 15 k perl-Error noarch 1:0.17029-13.fc39 fedora 40 k perl-Exporter noarch 5.77-500.fc39 fedora 31 k perl-Fcntl aarch64 1.15-502.fc39 updates 21 k perl-File-Basename noarch 2.86-502.fc39 updates 17 k perl-File-Find noarch 1.43-502.fc39 updates 25 k perl-File-Path noarch 2.18-500.fc39 fedora 35 k perl-File-Temp noarch 1:0.231.100-500.fc39 fedora 58 k perl-File-stat noarch 1.13-502.fc39 updates 17 k perl-FileHandle noarch 2.05-502.fc39 updates 16 k perl-Getopt-Long noarch 1:2.54-500.fc39 fedora 60 k perl-Getopt-Std noarch 1.13-502.fc39 updates 16 k perl-Git noarch 2.47.0-1.fc39 updates 38 k perl-HTTP-Tiny noarch 0.088-3.fc39 fedora 56 k perl-IO aarch64 1.52-502.fc39 updates 83 k perl-IO-Socket-IP noarch 0.42-1.fc39 fedora 42 k perl-IO-Socket-SSL noarch 2.083-3.fc39 fedora 225 k perl-IPC-Open3 noarch 1.22-502.fc39 updates 22 k perl-MIME-Base64 aarch64 3.16-500.fc39 fedora 30 k perl-Mozilla-CA noarch 20230801-1.fc39 fedora 13 k perl-Net-SSLeay aarch64 1.92-10.fc39 fedora 356 k perl-POSIX aarch64 2.13-502.fc39 updates 98 k perl-PathTools aarch64 3.89-500.fc39 fedora 88 k perl-Pod-Escapes noarch 1:1.07-500.fc39 fedora 20 k perl-Pod-Perldoc noarch 3.28.01-501.fc39 fedora 86 k perl-Pod-Simple noarch 1:3.45-4.fc39 fedora 218 k perl-Pod-Usage noarch 4:2.03-500.fc39 fedora 39 k perl-Scalar-List-Utils aarch64 5:1.63-500.fc39 fedora 71 k perl-SelectSaver noarch 1.02-502.fc39 updates 12 k perl-Socket aarch64 4:2.037-3.fc39 fedora 56 k perl-Storable aarch64 1:3.32-500.fc39 fedora 97 k perl-Symbol noarch 1.09-502.fc39 updates 14 k perl-Term-ANSIColor noarch 5.01-501.fc39 fedora 47 k perl-Term-Cap noarch 1.18-500.fc39 fedora 22 k perl-TermReadKey aarch64 2.38-18.fc39 fedora 35 k perl-Text-ParseWords noarch 3.31-500.fc39 fedora 16 k perl-Text-Tabs+Wrap noarch 2023.0511-3.fc39 fedora 22 k perl-Time-Local noarch 2:1.350-3.fc39 fedora 34 k perl-URI noarch 5.21-1.fc39 fedora 125 k perl-base noarch 2.27-502.fc39 updates 16 k perl-constant noarch 1.33-501.fc39 fedora 22 k perl-if noarch 0.61.000-502.fc39 updates 14 k perl-interpreter aarch64 4:5.38.2-502.fc39 updates 72 k perl-lib aarch64 0.65-502.fc39 updates 15 k perl-libnet noarch 3.15-501.fc39 fedora 129 k perl-libs aarch64 4:5.38.2-502.fc39 updates 2.3 M perl-locale noarch 1.10-502.fc39 updates 14 k perl-mro aarch64 1.28-502.fc39 updates 29 k perl-overload noarch 1.37-502.fc39 updates 46 k perl-overloading noarch 0.02-502.fc39 updates 13 k perl-parent noarch 1:0.241-500.fc39 fedora 14 k perl-podlators noarch 1:5.01-500.fc39 fedora 125 k perl-vars noarch 1.05-502.fc39 updates 13 k pixman aarch64 0.42.2-2.fc39 fedora 216 k poppler aarch64 23.08.0-1.fc39 fedora 1.1 M poppler-data noarch 0.4.11-5.fc39 fedora 2.0 M poppler-glib aarch64 23.08.0-1.fc39 fedora 178 k pyproject-rpm-macros noarch 1.16.0-1.fc39 updates 45 k python-pip-wheel noarch 23.2.1-2.fc39 updates 1.5 M python-rpm-macros noarch 3.12-8.fc39 updates 18 k python3 aarch64 3.12.7-1.fc39 updates 28 k python3-libs aarch64 3.12.7-1.fc39 updates 9.1 M python3-packaging noarch 23.1-4.fc39 fedora 114 k python3-rpm-generators noarch 14-7.fc39 fedora 30 k python3-rpm-macros noarch 3.12-8.fc39 updates 12 k rav1e-libs aarch64 0.7.1-4.fc39 updates 788 k rhash aarch64 1.4.3-3.fc39 fedora 192 k rsvg-pixbuf-loader aarch64 2.57.1-2.fc39 updates 16 k shared-mime-info aarch64 2.2-4.fc39 fedora 380 k svt-av1-libs aarch64 1.4.1-3.fc39 fedora 1.0 M tzdata noarch 2024a-2.fc39 updates 715 k urw-base35-bookman-fonts noarch 20200910-20.fc39 updates 847 k urw-base35-c059-fonts noarch 20200910-20.fc39 updates 874 k urw-base35-d050000l-fonts noarch 20200910-20.fc39 updates 76 k urw-base35-fonts noarch 20200910-20.fc39 updates 10 k urw-base35-fonts-common noarch 20200910-20.fc39 updates 21 k urw-base35-gothic-fonts noarch 20200910-20.fc39 updates 643 k urw-base35-nimbus-mono-ps-fonts noarch 20200910-20.fc39 updates 795 k urw-base35-nimbus-roman-fonts noarch 20200910-20.fc39 updates 856 k urw-base35-nimbus-sans-fonts noarch 20200910-20.fc39 updates 1.3 M urw-base35-p052-fonts noarch 20200910-20.fc39 updates 973 k urw-base35-standard-symbols-ps-fonts noarch 20200910-20.fc39 updates 58 k urw-base35-z003-fonts noarch 20200910-20.fc39 updates 276 k vim-filesystem noarch 2:9.1.825-1.fc39 updates 16 k xapian-core-libs aarch64 1.4.26-1.fc39 updates 707 k xml-common noarch 0.6.3-61.fc39 fedora 31 k Transaction Summary ======================================================================================================================================================= Install 230 Packages Total download size: 1.7 G Installed size: 3.6 G Downloading Packages: (1/230): glibc-devel-2.38-99.fc39.aarch64.rpm 8.4 MB/s | 498 kB 00:00 (2/230): libcurand-devel-12-6-10.3.7.77-2.aarch 56 MB/s | 248 kB 00:00 (3/230): cuda-toolkit-12-6-config-common-12.6.7 1.6 MB/s | 7.7 kB 00:00 (4/230): cuda-toolkit-12-config-common-12.6.77- 1.4 MB/s | 7.9 kB 00:00 (5/230): cuda-toolkit-config-common-12.6.77-1.n 3.0 MB/s | 7.9 kB 00:00 (6/230): cuda-cccl-12-6-12.6.77-1.aarch64.rpm 191 MB/s | 1.6 MB 00:00 (7/230): cuda-crt-12-6-12.6.85-1.aarch64.rpm 15 MB/s | 110 kB 00:00 (8/230): cuda-cudart-12-6-12.6.77-1.aarch64.rpm 10 MB/s | 236 kB 00:00 (9/230): cuda-gcc-11-c++-11.2.1-1.fc39.aarch64. 91 MB/s | 13 MB 00:00 (10/230): cuda-driver-devel-12-6-12.6.77-1.aarc 6.2 MB/s | 43 kB 00:00 (11/230): cuda-cudart-devel-12-6-12.6.77-1.aarc 54 MB/s | 2.0 MB 00:00 (12/230): cuda-nvml-devel-12-6-12.6.77-1.aarch6 65 MB/s | 231 kB 00:00 (13/230): cuda-nvrtc-12-6-12.6.85-1.aarch64.rpm 336 MB/s | 22 MB 00:00 (14/230): cuda-nvrtc-devel-12-6-12.6.85-1.aarch 343 MB/s | 28 MB 00:00 (15/230): cuda-gcc-11-11.2.1-1.fc39.aarch64.rpm 79 MB/s | 27 MB 00:00 (16/230): cuda-nvtx-12-6-12.6.77-1.aarch64.rpm 3.3 MB/s | 89 kB 00:00 (17/230): cuda-nvcc-12-6-12.6.85-1.aarch64.rpm 244 MB/s | 62 MB 00:00 (18/230): cuda-nvvm-12-6-12.6.85-1.aarch64.rpm 182 MB/s | 23 MB 00:00 (19/230): libcublas-12-6-12.6.4.1-1.aarch64.rpm 299 MB/s | 372 MB 00:01 (20/230): libcudnn9-devel-cuda-12-9.5.1.17-1.aa 18 MB/s | 53 kB 00:00 (21/230): libcurand-12-6-10.3.7.77-1.aarch64.rp 316 MB/s | 53 MB 00:00 (22/230): abattis-cantarell-vf-fonts-0.301-10.f 6.6 MB/s | 121 kB 00:00 (23/230): adobe-mappings-pdf-20190401-5.fc39.no 78 MB/s | 698 kB 00:00 (24/230): avahi-libs-0.8-24.fc39.aarch64.rpm 24 MB/s | 67 kB 00:00 (25/230): cairo-1.18.0-1.fc39.aarch64.rpm 118 MB/s | 692 kB 00:00 (26/230): cairo-gobject-1.18.0-1.fc39.aarch64.r 1.1 MB/s | 18 kB 00:00 (27/230): libcublas-devel-12-6-12.6.4.1-1.aarch 227 MB/s | 417 MB 00:01 (28/230): clang16-resource-filesystem-16.0.6-3. 466 kB/s | 13 kB 00:00 (29/230): libcudnn9-cuda-12-9.5.1.17-1.aarch64. 235 MB/s | 543 MB 00:02 (30/230): dbus-libs-1.14.10-1.fc39.aarch64.rpm 308 kB/s | 156 kB 00:00 (31/230): default-fonts-core-sans-4.0-9.fc39.no 6.1 MB/s | 32 kB 00:00 (32/230): clang16-libs-16.0.6-3.fc39.aarch64.rp 21 MB/s | 21 MB 00:00 (33/230): fonts-filesystem-2.0.5-12.fc39.noarch 395 kB/s | 8.2 kB 00:00 (34/230): fribidi-1.0.13-2.fc39.aarch64.rpm 34 MB/s | 91 kB 00:00 (35/230): freetype-2.13.1-2.fc39.aarch64.rpm 94 MB/s | 406 kB 00:00 (36/230): gd-2.3.3-12.fc39.aarch64.rpm 37 MB/s | 133 kB 00:00 (37/230): gc-8.2.2-4.fc39.aarch64.rpm 20 MB/s | 110 kB 00:00 (38/230): gdk-pixbuf2-2.42.10-5.fc39.aarch64.rp 144 MB/s | 482 kB 00:00 (39/230): graphite2-1.3.14-12.fc39.aarch64.rpm 34 MB/s | 93 kB 00:00 (40/230): gts-0.7.6-46.20121130.fc39.aarch64.rp 70 MB/s | 234 kB 00:00 (41/230): google-droid-sans-fonts-20200215-17.f 203 MB/s | 2.7 MB 00:00 (42/230): harfbuzz-8.2.1-2.fc39.aarch64.rpm 146 MB/s | 934 kB 00:00 (43/230): isl-0.16.1-18.fc39.aarch64.rpm 110 MB/s | 838 kB 00:00 (44/230): guile22-2.2.7-9.fc39.aarch64.rpm 238 MB/s | 6.5 MB 00:00 (45/230): jbig2dec-libs-0.19-10.fc39.aarch64.rp 8.7 MB/s | 71 kB 00:00 (46/230): jbigkit-libs-2.1-26.fc39.aarch64.rpm 28 MB/s | 53 kB 00:00 (47/230): jsoncpp-1.9.5-5.fc39.aarch64.rpm 39 MB/s | 91 kB 00:00 (48/230): lasi-1.1.3-11.fc39.aarch64.rpm 12 MB/s | 53 kB 00:00 (49/230): lcms2-2.15-2.fc39.aarch64.rpm 34 MB/s | 176 kB 00:00 (50/230): libICE-1.0.10-11.fc39.aarch64.rpm 12 MB/s | 70 kB 00:00 (51/230): libSM-1.2.3-13.fc39.aarch64.rpm 8.2 MB/s | 41 kB 00:00 (52/230): libXau-1.0.11-3.fc39.aarch64.rpm 8.8 MB/s | 32 kB 00:00 (53/230): doxygen-1.9.7-3.fc39.aarch64.rpm 46 MB/s | 4.8 MB 00:00 (54/230): libXrender-0.9.11-3.fc39.aarch64.rpm 13 MB/s | 27 kB 00:00 (55/230): libXft-2.3.8-3.fc39.aarch64.rpm 4.0 MB/s | 71 kB 00:00 (56/230): libXt-1.2.1-5.fc39.aarch64.rpm 69 MB/s | 176 kB 00:00 (57/230): libavif-0.11.1-11.fc39.aarch64.rpm 32 MB/s | 80 kB 00:00 (58/230): libXext-1.3.5-3.fc39.aarch64.rpm 1.5 MB/s | 39 kB 00:00 (59/230): libb2-0.98.1-9.fc39.aarch64.rpm 7.6 MB/s | 24 kB 00:00 (60/230): libcbor-0.10.2-2.fc39.aarch64.rpm 16 MB/s | 57 kB 00:00 (61/230): libdatrie-0.2.13-7.fc39.aarch64.rpm 11 MB/s | 32 kB 00:00 (62/230): libdav1d-1.2.1-2.fc39.aarch64.rpm 107 MB/s | 350 kB 00:00 (63/230): libijs-0.35-19.fc39.aarch64.rpm 12 MB/s | 29 kB 00:00 (64/230): libjpeg-turbo-2.1.4-3.fc39.aarch64.rp 78 MB/s | 196 kB 00:00 (65/230): liblerc-4.0.0-4.fc39.aarch64.rpm 64 MB/s | 179 kB 00:00 (66/230): libmpc-1.3.1-3.fc39.aarch64.rpm 24 MB/s | 72 kB 00:00 (67/230): libfido2-1.13.0-3.fc39.aarch64.rpm 11 MB/s | 96 kB 00:00 (68/230): libpaper-2.1.1-1.fc39.aarch64.rpm 7.7 MB/s | 27 kB 00:00 (69/230): libpng-1.6.37-15.fc39.aarch64.rpm 39 MB/s | 115 kB 00:00 (70/230): libthai-0.1.29-6.fc39.aarch64.rpm 55 MB/s | 213 kB 00:00 (71/230): libwebp-1.3.2-2.fc39.aarch64.rpm 67 MB/s | 243 kB 00:00 (72/230): libxcrypt-devel-4.4.36-2.fc39.aarch64 12 MB/s | 30 kB 00:00 (73/230): make-4.4.1-2.fc39.aarch64.rpm 128 MB/s | 585 kB 00:00 (74/230): mpdecimal-2.5.1-7.fc39.aarch64.rpm 41 MB/s | 90 kB 00:00 (75/230): libxcb-1.13.1-12.fc39.aarch64.rpm 17 MB/s | 238 kB 00:00 (76/230): netpbm-11.02.00-2.fc39.aarch64.rpm 50 MB/s | 183 kB 00:00 (77/230): pango-1.51.0-1.fc39.aarch64.rpm 108 MB/s | 339 kB 00:00 (78/230): nettle-3.9.1-2.fc39.aarch64.rpm 76 MB/s | 434 kB 00:00 (79/230): perl-Carp-1.54-500.fc39.noarch.rpm 19 MB/s | 29 kB 00:00 (80/230): perl-Data-Dumper-2.188-501.fc39.aarch 26 MB/s | 55 kB 00:00 (81/230): perl-Digest-1.20-500.fc39.noarch.rpm 13 MB/s | 25 kB 00:00 (82/230): perl-Digest-MD5-2.58-500.fc39.aarch64 18 MB/s | 36 kB 00:00 (83/230): perl-Error-0.17029-13.fc39.noarch.rpm 13 MB/s | 40 kB 00:00 (84/230): perl-Encode-3.19-500.fc39.aarch64.rpm 203 MB/s | 1.7 MB 00:00 (85/230): perl-Exporter-5.77-500.fc39.noarch.rp 8.0 MB/s | 31 kB 00:00 (86/230): perl-File-Path-2.18-500.fc39.noarch.r 13 MB/s | 35 kB 00:00 (87/230): perl-File-Temp-0.231.100-500.fc39.noa 15 MB/s | 58 kB 00:00 (88/230): perl-Getopt-Long-2.54-500.fc39.noarch 30 MB/s | 60 kB 00:00 (89/230): perl-HTTP-Tiny-0.088-3.fc39.noarch.rp 28 MB/s | 56 kB 00:00 (90/230): perl-IO-Socket-IP-0.42-1.fc39.noarch. 22 MB/s | 42 kB 00:00 (91/230): perl-MIME-Base64-3.16-500.fc39.aarch6 18 MB/s | 30 kB 00:00 (92/230): perl-IO-Socket-SSL-2.083-3.fc39.noarc 74 MB/s | 225 kB 00:00 (93/230): perl-Mozilla-CA-20230801-1.fc39.noarc 11 MB/s | 13 kB 00:00 (94/230): perl-PathTools-3.89-500.fc39.aarch64. 38 MB/s | 88 kB 00:00 (95/230): perl-Pod-Escapes-1.07-500.fc39.noarch 11 MB/s | 20 kB 00:00 (96/230): perl-Pod-Perldoc-3.28.01-501.fc39.noa 41 MB/s | 86 kB 00:00 (97/230): perl-Pod-Simple-3.45-4.fc39.noarch.rp 73 MB/s | 218 kB 00:00 (98/230): perl-Pod-Usage-2.03-500.fc39.noarch.r 17 MB/s | 39 kB 00:00 (99/230): perl-Net-SSLeay-1.92-10.fc39.aarch64. 23 MB/s | 356 kB 00:00 (100/230): perl-Scalar-List-Utils-1.63-500.fc39 19 MB/s | 71 kB 00:00 (101/230): perl-Socket-2.037-3.fc39.aarch64.rpm 17 MB/s | 56 kB 00:00 (102/230): perl-Storable-3.32-500.fc39.aarch64. 27 MB/s | 97 kB 00:00 (103/230): perl-Term-ANSIColor-5.01-501.fc39.no 12 MB/s | 47 kB 00:00 (104/230): perl-Term-Cap-1.18-500.fc39.noarch.r 4.7 MB/s | 22 kB 00:00 (105/230): perl-TermReadKey-2.38-18.fc39.aarch6 9.7 MB/s | 35 kB 00:00 (106/230): perl-Text-ParseWords-3.31-500.fc39.n 5.7 MB/s | 16 kB 00:00 (107/230): perl-Text-Tabs+Wrap-2023.0511-3.fc39 8.2 MB/s | 22 kB 00:00 (108/230): perl-Time-Local-1.350-3.fc39.noarch. 15 MB/s | 34 kB 00:00 (109/230): perl-URI-5.21-1.fc39.noarch.rpm 42 MB/s | 125 kB 00:00 (110/230): perl-constant-1.33-501.fc39.noarch.r 5.0 MB/s | 22 kB 00:00 (111/230): perl-libnet-3.15-501.fc39.noarch.rpm 30 MB/s | 129 kB 00:00 (112/230): perl-parent-0.241-500.fc39.noarch.rp 1.3 MB/s | 14 kB 00:00 (113/230): perl-podlators-5.01-500.fc39.noarch. 11 MB/s | 125 kB 00:00 (114/230): pixman-0.42.2-2.fc39.aarch64.rpm 19 MB/s | 216 kB 00:00 (115/230): poppler-23.08.0-1.fc39.aarch64.rpm 64 MB/s | 1.1 MB 00:00 (116/230): poppler-glib-23.08.0-1.fc39.aarch64. 30 MB/s | 178 kB 00:00 (117/230): python3-packaging-23.1-4.fc39.noarch 21 MB/s | 114 kB 00:00 (118/230): poppler-data-0.4.11-5.fc39.noarch.rp 89 MB/s | 2.0 MB 00:00 (119/230): python3-rpm-generators-14-7.fc39.noa 7.9 MB/s | 30 kB 00:00 (120/230): rhash-1.4.3-3.fc39.aarch64.rpm 31 MB/s | 192 kB 00:00 (121/230): shared-mime-info-2.2-4.fc39.aarch64. 48 MB/s | 380 kB 00:00 (122/230): svt-av1-libs-1.4.1-3.fc39.aarch64.rp 166 MB/s | 1.0 MB 00:00 (123/230): xml-common-0.6.3-61.fc39.noarch.rpm 5.5 MB/s | 31 kB 00:00 (124/230): adobe-mappings-cmap-deprecated-20231 36 MB/s | 111 kB 00:00 (125/230): annobin-docs-12.60-1.fc39.noarch.rpm 12 MB/s | 88 kB 00:00 (126/230): adobe-mappings-cmap-20231115-1.fc39. 135 MB/s | 2.2 MB 00:00 (127/230): annobin-plugin-gcc-12.60-1.fc39.aarc 92 MB/s | 964 kB 00:00 (128/230): llvm16-libs-16.0.6-5.fc39.aarch64.rp 126 MB/s | 25 MB 00:00 (129/230): cmake-data-3.30.5-1.fc39.noarch.rpm 58 MB/s | 2.3 MB 00:00 (130/230): cmake-filesystem-3.30.5-1.fc39.aarch 2.8 MB/s | 17 kB 00:00 (131/230): cmake-3.30.5-1.fc39.aarch64.rpm 128 MB/s | 7.7 MB 00:00 (132/230): cmake-rpm-macros-3.30.5-1.fc39.noarc 1.2 MB/s | 17 kB 00:00 (133/230): crypto-policies-scripts-20231204-1.g 32 MB/s | 117 kB 00:00 (134/230): cups-libs-2.4.11-1.fc39.aarch64.rpm 55 MB/s | 268 kB 00:00 (135/230): emacs-filesystem-29.4-2.fc39.noarch. 1.9 MB/s | 7.3 kB 00:00 (136/230): expat-2.6.3-1.fc39.aarch64.rpm 27 MB/s | 112 kB 00:00 (137/230): fontconfig-2.14.2-6.fc39.aarch64.rpm 62 MB/s | 302 kB 00:00 (138/230): cpp-13.3.1-3.fc39.aarch64.rpm 267 MB/s | 9.6 MB 00:00 (139/230): gcc-plugin-annobin-13.3.1-3.fc39.aar 7.8 MB/s | 58 kB 00:00 (140/230): git-2.47.0-1.fc39.aarch64.rpm 6.6 MB/s | 51 kB 00:00 (141/230): git-core-2.47.0-1.fc39.aarch64.rpm 182 MB/s | 4.9 MB 00:00 (142/230): git-core-doc-2.47.0-1.fc39.noarch.rp 128 MB/s | 3.0 MB 00:00 (143/230): gcc-c++-13.3.1-3.fc39.aarch64.rpm 118 MB/s | 12 MB 00:00 (144/230): glib2-2.78.6-1.fc39.aarch64.rpm 106 MB/s | 2.8 MB 00:00 (145/230): gnutls-3.8.6-1.fc39.aarch64.rpm 45 MB/s | 1.1 MB 00:00 (146/230): google-noto-fonts-common-20240101-1. 1.1 MB/s | 17 kB 00:00 (147/230): gcc-13.3.1-3.fc39.aarch64.rpm 194 MB/s | 31 MB 00:00 (148/230): google-noto-sans-vf-fonts-20240101-1 16 MB/s | 593 kB 00:00 (149/230): graphviz-8.1.0-6.fc39.aarch64.rpm 108 MB/s | 4.9 MB 00:00 (150/230): groff-base-1.23.0-3.fc39.aarch64.rpm 90 MB/s | 1.1 MB 00:00 (151/230): highway-1.1.0-1.fc39.aarch64.rpm 10 MB/s | 97 kB 00:00 (152/230): less-633-4.fc39.aarch64.rpm 41 MB/s | 176 kB 00:00 (153/230): libX11-1.8.9-1.fc39.aarch64.rpm 101 MB/s | 639 kB 00:00 (154/230): kernel-headers-6.11.3-100.fc39.aarch 173 MB/s | 1.6 MB 00:00 (155/230): libX11-common-1.8.9-1.fc39.noarch.rp 41 MB/s | 176 kB 00:00 (156/230): libXpm-3.5.17-1.fc39.aarch64.rpm 17 MB/s | 64 kB 00:00 (157/230): libasan-13.3.1-3.fc39.aarch64.rpm 83 MB/s | 459 kB 00:00 (158/230): libedit-3.1-53.20240808cvs.fc39.aarc 14 MB/s | 107 kB 00:00 (159/230): libaom-3.9.0-1.fc39.aarch64.rpm 91 MB/s | 1.5 MB 00:00 (160/230): libatomic-13.3.1-3.fc39.aarch64.rpm 2.8 MB/s | 47 kB 00:00 (161/230): libimagequant-4.0.3-5.fc39.aarch64.r 38 MB/s | 286 kB 00:00 (162/230): libjxl-0.8.3-1.fc39.aarch64.rpm 84 MB/s | 775 kB 00:00 (163/230): libgs-10.02.1-7.fc39.aarch64.rpm 160 MB/s | 3.4 MB 00:00 (164/230): librsvg2-2.57.1-2.fc39.aarch64.rpm 111 MB/s | 1.5 MB 00:00 (165/230): libstdc++-devel-13.3.1-3.fc39.aarch6 183 MB/s | 2.6 MB 00:00 (166/230): libtiff-4.4.0-10.fc39.aarch64.rpm 26 MB/s | 196 kB 00:00 (167/230): libubsan-13.3.1-3.fc39.aarch64.rpm 45 MB/s | 214 kB 00:00 (168/230): libuv-1.49.2-1.fc39.aarch64.rpm 63 MB/s | 256 kB 00:00 (169/230): ncurses-6.4-7.20230520.fc39.1.aarch6 88 MB/s | 414 kB 00:00 (170/230): nspr-4.35.0-24.fc39.aarch64.rpm 31 MB/s | 135 kB 00:00 (171/230): nss-3.105.0-1.fc39.aarch64.rpm 163 MB/s | 703 kB 00:00 (172/230): nss-softokn-freebl-3.105.0-1.fc39.aa 64 MB/s | 301 kB 00:00 (173/230): nss-softokn-3.105.0-1.fc39.aarch64.r 72 MB/s | 421 kB 00:00 (174/230): nss-sysinit-3.105.0-1.fc39.aarch64.r 9.7 MB/s | 18 kB 00:00 (175/230): nss-util-3.105.0-1.fc39.aarch64.rpm 26 MB/s | 87 kB 00:00 (176/230): openjpeg2-2.5.2-1.fc39.aarch64.rpm 55 MB/s | 176 kB 00:00 (177/230): openssh-9.3p1-11.fc39.aarch64.rpm 98 MB/s | 431 kB 00:00 (178/230): perl-AutoLoader-5.74-502.fc39.noarch 6.0 MB/s | 21 kB 00:00 (179/230): perl-B-1.88-502.fc39.aarch64.rpm 48 MB/s | 178 kB 00:00 (180/230): perl-Class-Struct-0.68-502.fc39.noar 16 MB/s | 22 kB 00:00 (181/230): perl-DynaLoader-1.54-502.fc39.aarch6 19 MB/s | 26 kB 00:00 (182/230): perl-Errno-1.37-502.fc39.aarch64.rpm 7.7 MB/s | 15 kB 00:00 (183/230): perl-Fcntl-1.15-502.fc39.aarch64.rpm 13 MB/s | 21 kB 00:00 (184/230): perl-File-Basename-2.86-502.fc39.noa 9.0 MB/s | 17 kB 00:00 (185/230): perl-File-Find-1.43-502.fc39.noarch. 9.0 MB/s | 25 kB 00:00 (186/230): perl-File-stat-1.13-502.fc39.noarch. 10 MB/s | 17 kB 00:00 (187/230): perl-FileHandle-2.05-502.fc39.noarch 11 MB/s | 16 kB 00:00 (188/230): perl-Getopt-Std-1.13-502.fc39.noarch 10 MB/s | 16 kB 00:00 (189/230): perl-Git-2.47.0-1.fc39.noarch.rpm 18 MB/s | 38 kB 00:00 (190/230): perl-IO-1.52-502.fc39.aarch64.rpm 35 MB/s | 83 kB 00:00 (191/230): perl-IPC-Open3-1.22-502.fc39.noarch. 11 MB/s | 22 kB 00:00 (192/230): perl-POSIX-2.13-502.fc39.aarch64.rpm 46 MB/s | 98 kB 00:00 (193/230): perl-SelectSaver-1.02-502.fc39.noarc 7.8 MB/s | 12 kB 00:00 (194/230): perl-Symbol-1.09-502.fc39.noarch.rpm 7.3 MB/s | 14 kB 00:00 (195/230): perl-base-2.27-502.fc39.noarch.rpm 7.5 MB/s | 16 kB 00:00 (196/230): perl-if-0.61.000-502.fc39.noarch.rpm 7.9 MB/s | 14 kB 00:00 (197/230): perl-interpreter-5.38.2-502.fc39.aar 37 MB/s | 72 kB 00:00 (198/230): perl-lib-0.65-502.fc39.aarch64.rpm 6.4 MB/s | 15 kB 00:00 (199/230): perl-locale-1.10-502.fc39.noarch.rpm 9.3 MB/s | 14 kB 00:00 (200/230): perl-mro-1.28-502.fc39.aarch64.rpm 6.0 MB/s | 29 kB 00:00 (201/230): perl-libs-5.38.2-502.fc39.aarch64.rp 190 MB/s | 2.3 MB 00:00 (202/230): openssh-clients-9.3p1-11.fc39.aarch6 19 MB/s | 731 kB 00:00 (203/230): perl-overload-1.37-502.fc39.noarch.r 6.3 MB/s | 46 kB 00:00 (204/230): perl-overloading-0.02-502.fc39.noarc 4.1 MB/s | 13 kB 00:00 (205/230): perl-vars-1.05-502.fc39.noarch.rpm 5.8 MB/s | 13 kB 00:00 (206/230): pyproject-rpm-macros-1.16.0-1.fc39.n 20 MB/s | 45 kB 00:00 (207/230): python-rpm-macros-3.12-8.fc39.noarch 8.4 MB/s | 18 kB 00:00 (208/230): python3-3.12.7-1.fc39.aarch64.rpm 9.9 MB/s | 28 kB 00:00 (209/230): python3-devel-3.12.7-1.fc39.aarch64. 84 MB/s | 314 kB 00:00 (210/230): python-pip-wheel-23.2.1-2.fc39.noarc 174 MB/s | 1.5 MB 00:00 (211/230): python3-rpm-macros-3.12-8.fc39.noarc 6.0 MB/s | 12 kB 00:00 (212/230): rav1e-libs-0.7.1-4.fc39.aarch64.rpm 87 MB/s | 788 kB 00:00 (213/230): python3-setuptools-67.7.2-8.fc39.noa 109 MB/s | 1.5 MB 00:00 (214/230): rsvg-pixbuf-loader-2.57.1-2.fc39.aar 2.5 MB/s | 16 kB 00:00 (215/230): tzdata-2024a-2.fc39.noarch.rpm 104 MB/s | 715 kB 00:00 (216/230): urw-base35-bookman-fonts-20200910-20 110 MB/s | 847 kB 00:00 (217/230): urw-base35-d050000l-fonts-20200910-2 18 MB/s | 76 kB 00:00 (218/230): urw-base35-c059-fonts-20200910-20.fc 102 MB/s | 874 kB 00:00 (219/230): urw-base35-fonts-20200910-20.fc39.no 2.7 MB/s | 10 kB 00:00 (220/230): urw-base35-fonts-common-20200910-20. 6.7 MB/s | 21 kB 00:00 (221/230): python3-libs-3.12.7-1.fc39.aarch64.r 185 MB/s | 9.1 MB 00:00 (222/230): urw-base35-gothic-fonts-20200910-20. 42 MB/s | 643 kB 00:00 (223/230): urw-base35-nimbus-mono-ps-fonts-2020 64 MB/s | 795 kB 00:00 (224/230): urw-base35-nimbus-roman-fonts-202009 158 MB/s | 856 kB 00:00 (225/230): urw-base35-p052-fonts-20200910-20.fc 107 MB/s | 973 kB 00:00 (226/230): urw-base35-nimbus-sans-fonts-2020091 118 MB/s | 1.3 MB 00:00 (227/230): urw-base35-standard-symbols-ps-fonts 8.5 MB/s | 58 kB 00:00 (228/230): urw-base35-z003-fonts-20200910-20.fc 63 MB/s | 276 kB 00:00 (229/230): vim-filesystem-9.1.825-1.fc39.noarch 6.0 MB/s | 16 kB 00:00 (230/230): xapian-core-libs-1.4.26-1.fc39.aarch 79 MB/s | 707 kB 00:00 -------------------------------------------------------------------------------- Total 466 MB/s | 1.7 GB 00:03 Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Preparing : 1/1 Installing : libpng-2:1.6.37-15.fc39.aarch64 1/230 Installing : nspr-4.35.0-24.fc39.aarch64 2/230 Installing : libjpeg-turbo-2.1.4-3.fc39.aarch64 3/230 Installing : fonts-filesystem-1:2.0.5-12.fc39.noarch 4/230 Installing : urw-base35-fonts-common-20200910-20.fc39.noarch 5/230 Installing : nss-util-3.105.0-1.fc39.aarch64 6/230 Installing : expat-2.6.3-1.fc39.aarch64 7/230 Installing : libmpc-1.3.1-3.fc39.aarch64 8/230 Installing : python-rpm-macros-3.12-8.fc39.noarch 9/230 Installing : libwebp-1.3.2-2.fc39.aarch64 10/230 Installing : cuda-toolkit-config-common-12.6.77-1.noarch 11/230 Installing : cuda-toolkit-12-config-common-12.6.77-1.noarch 12/230 Installing : cuda-toolkit-12-6-config-common-12.6.77-1.noarch 13/230 Installing : python3-rpm-macros-3.12-8.fc39.noarch 14/230 Installing : openjpeg2-2.5.2-1.fc39.aarch64 15/230 Installing : libedit-3.1-53.20240808cvs.fc39.aarch64 16/230 Installing : libatomic-13.3.1-3.fc39.aarch64 17/230 Installing : cmake-filesystem-3.30.5-1.fc39.aarch64 18/230 Installing : adobe-mappings-cmap-20231115-1.fc39.noarch 19/230 Installing : libICE-1.0.10-11.fc39.aarch64 20/230 Installing : lcms2-2.15-2.fc39.aarch64 21/230 Installing : libSM-1.2.3-13.fc39.aarch64 22/230 Installing : adobe-mappings-cmap-deprecated-20231115-1.fc39.n 23/230 Installing : llvm16-libs-16.0.6-5.fc39.aarch64 24/230 Installing : pyproject-rpm-macros-1.16.0-1.fc39.noarch 25/230 Installing : cuda-cudart-12-6-12.6.77-1.aarch64 26/230 Running scriptlet: cuda-cudart-12-6-12.6.77-1.aarch64 26/230 Installing : libcublas-12-6-12.6.4.1-1.aarch64 27/230 Running scriptlet: libcublas-12-6-12.6.4.1-1.aarch64 27/230 Installing : libcurand-12-6-10.3.7.77-1.aarch64 28/230 Running scriptlet: libcurand-12-6-10.3.7.77-1.aarch64 28/230 Installing : cuda-gcc-11-11.2.1-1.fc39.aarch64 29/230 Installing : cpp-13.3.1-3.fc39.aarch64 30/230 Installing : nss-softokn-freebl-3.105.0-1.fc39.aarch64 31/230 Installing : nss-softokn-3.105.0-1.fc39.aarch64 32/230 Installing : urw-base35-bookman-fonts-20200910-20.fc39.noarch 33/230 Running scriptlet: urw-base35-bookman-fonts-20200910-20.fc39.noarch 33/230 Installing : urw-base35-c059-fonts-20200910-20.fc39.noarch 34/230 Running scriptlet: urw-base35-c059-fonts-20200910-20.fc39.noarch 34/230 Installing : urw-base35-d050000l-fonts-20200910-20.fc39.noarc 35/230 Running scriptlet: urw-base35-d050000l-fonts-20200910-20.fc39.noarc 35/230 Installing : urw-base35-gothic-fonts-20200910-20.fc39.noarch 36/230 Running scriptlet: urw-base35-gothic-fonts-20200910-20.fc39.noarch 36/230 Installing : urw-base35-nimbus-mono-ps-fonts-20200910-20.fc39 37/230 Running scriptlet: urw-base35-nimbus-mono-ps-fonts-20200910-20.fc39 37/230 Installing : urw-base35-nimbus-roman-fonts-20200910-20.fc39.n 38/230 Running scriptlet: urw-base35-nimbus-roman-fonts-20200910-20.fc39.n 38/230 Installing : urw-base35-nimbus-sans-fonts-20200910-20.fc39.no 39/230 Running scriptlet: urw-base35-nimbus-sans-fonts-20200910-20.fc39.no 39/230 Installing : urw-base35-p052-fonts-20200910-20.fc39.noarch 40/230 Running scriptlet: urw-base35-p052-fonts-20200910-20.fc39.noarch 40/230 Installing : urw-base35-standard-symbols-ps-fonts-20200910-20 41/230 Running scriptlet: urw-base35-standard-symbols-ps-fonts-20200910-20 41/230 Installing : urw-base35-z003-fonts-20200910-20.fc39.noarch 42/230 Running scriptlet: urw-base35-z003-fonts-20200910-20.fc39.noarch 42/230 Installing : urw-base35-fonts-20200910-20.fc39.noarch 43/230 Installing : abattis-cantarell-vf-fonts-0.301-10.fc39.noarch 44/230 Installing : xapian-core-libs-1.4.26-1.fc39.aarch64 45/230 Installing : vim-filesystem-2:9.1.825-1.fc39.noarch 46/230 Installing : tzdata-2024a-2.fc39.noarch 47/230 Installing : rav1e-libs-0.7.1-4.fc39.aarch64 48/230 Installing : python-pip-wheel-23.2.1-2.fc39.noarch 49/230 Installing : openssh-9.3p1-11.fc39.aarch64 50/230 Installing : ncurses-6.4-7.20230520.fc39.1.aarch64 51/230 Installing : libuv-1:1.49.2-1.fc39.aarch64 52/230 Installing : libubsan-13.3.1-3.fc39.aarch64 53/230 Installing : libstdc++-devel-13.3.1-3.fc39.aarch64 54/230 Installing : libimagequant-4.0.3-5.fc39.aarch64 55/230 Installing : libasan-13.3.1-3.fc39.aarch64 56/230 Installing : libX11-common-1.8.9-1.fc39.noarch 57/230 Installing : less-633-4.fc39.aarch64 58/230 Installing : kernel-headers-6.11.3-100.fc39.aarch64 59/230 Installing : libxcrypt-devel-4.4.36-2.fc39.aarch64 60/230 Installing : glibc-devel-2.38-99.fc39.aarch64 61/230 Installing : highway-1.1.0-1.fc39.aarch64 62/230 Running scriptlet: groff-base-1.23.0-3.fc39.aarch64 63/230 Installing : groff-base-1.23.0-3.fc39.aarch64 63/230 Running scriptlet: groff-base-1.23.0-3.fc39.aarch64 63/230 Installing : perl-Digest-1.20-500.fc39.noarch 64/230 Installing : perl-Digest-MD5-2.58-500.fc39.aarch64 65/230 Installing : perl-B-1.88-502.fc39.aarch64 66/230 Installing : perl-FileHandle-2.05-502.fc39.noarch 67/230 Installing : perl-Data-Dumper-2.188-501.fc39.aarch64 68/230 Installing : perl-libnet-3.15-501.fc39.noarch 69/230 Installing : perl-AutoLoader-5.74-502.fc39.noarch 70/230 Installing : perl-base-2.27-502.fc39.noarch 71/230 Installing : perl-URI-5.21-1.fc39.noarch 72/230 Installing : perl-Pod-Escapes-1:1.07-500.fc39.noarch 73/230 Installing : perl-Text-Tabs+Wrap-2023.0511-3.fc39.noarch 74/230 Installing : perl-Time-Local-2:1.350-3.fc39.noarch 75/230 Installing : perl-Net-SSLeay-1.92-10.fc39.aarch64 76/230 Installing : perl-Mozilla-CA-20230801-1.fc39.noarch 77/230 Installing : perl-File-Path-2.18-500.fc39.noarch 78/230 Installing : perl-if-0.61.000-502.fc39.noarch 79/230 Installing : perl-locale-1.10-502.fc39.noarch 80/230 Installing : perl-IO-Socket-IP-0.42-1.fc39.noarch 81/230 Installing : perl-IO-Socket-SSL-2.083-3.fc39.noarch 82/230 Installing : perl-Term-ANSIColor-5.01-501.fc39.noarch 83/230 Installing : perl-Term-Cap-1.18-500.fc39.noarch 84/230 Installing : perl-Class-Struct-0.68-502.fc39.noarch 85/230 Installing : perl-POSIX-2.13-502.fc39.aarch64 86/230 Installing : perl-File-Temp-1:0.231.100-500.fc39.noarch 87/230 Installing : perl-HTTP-Tiny-0.088-3.fc39.noarch 88/230 Installing : perl-Pod-Simple-1:3.45-4.fc39.noarch 89/230 Installing : perl-IPC-Open3-1.22-502.fc39.noarch 90/230 Installing : perl-Socket-4:2.037-3.fc39.aarch64 91/230 Installing : perl-SelectSaver-1.02-502.fc39.noarch 92/230 Installing : perl-Symbol-1.09-502.fc39.noarch 93/230 Installing : perl-podlators-1:5.01-500.fc39.noarch 94/230 Installing : perl-Pod-Perldoc-3.28.01-501.fc39.noarch 95/230 Installing : perl-File-stat-1.13-502.fc39.noarch 96/230 Installing : perl-Text-ParseWords-3.31-500.fc39.noarch 97/230 Installing : perl-Fcntl-1.15-502.fc39.aarch64 98/230 Installing : perl-mro-1.28-502.fc39.aarch64 99/230 Installing : perl-Pod-Usage-4:2.03-500.fc39.noarch 100/230 Installing : perl-IO-1.52-502.fc39.aarch64 101/230 Installing : perl-overloading-0.02-502.fc39.noarch 102/230 Installing : perl-MIME-Base64-3.16-500.fc39.aarch64 103/230 Installing : perl-Scalar-List-Utils-5:1.63-500.fc39.aarch64 104/230 Installing : perl-constant-1.33-501.fc39.noarch 105/230 Installing : perl-parent-1:0.241-500.fc39.noarch 106/230 Installing : perl-Errno-1.37-502.fc39.aarch64 107/230 Installing : perl-File-Basename-2.86-502.fc39.noarch 108/230 Installing : perl-Getopt-Std-1.13-502.fc39.noarch 109/230 Installing : perl-Storable-1:3.32-500.fc39.aarch64 110/230 Installing : perl-Getopt-Long-1:2.54-500.fc39.noarch 111/230 Installing : perl-overload-1.37-502.fc39.noarch 112/230 Installing : perl-vars-1.05-502.fc39.noarch 113/230 Installing : perl-Exporter-5.77-500.fc39.noarch 114/230 Installing : perl-PathTools-3.89-500.fc39.aarch64 115/230 Installing : perl-Encode-4:3.19-500.fc39.aarch64 116/230 Installing : perl-DynaLoader-1.54-502.fc39.aarch64 117/230 Installing : perl-Carp-1.54-500.fc39.noarch 118/230 Installing : perl-libs-4:5.38.2-502.fc39.aarch64 119/230 Installing : perl-interpreter-4:5.38.2-502.fc39.aarch64 120/230 Installing : perl-Error-1:0.17029-13.fc39.noarch 121/230 Installing : perl-TermReadKey-2.38-18.fc39.aarch64 122/230 Installing : perl-File-Find-1.43-502.fc39.noarch 123/230 Installing : perl-lib-0.65-502.fc39.aarch64 124/230 Installing : google-noto-fonts-common-20240101-1.fc39.noarch 125/230 Installing : google-noto-sans-vf-fonts-20240101-1.fc39.noarch 126/230 Installing : default-fonts-core-sans-4.0-9.fc39.noarch 127/230 Installing : google-droid-sans-fonts-20200215-17.fc39.noarch 128/230 Installing : emacs-filesystem-1:29.4-2.fc39.noarch 129/230 Installing : annobin-docs-12.60-1.fc39.noarch 130/230 Running scriptlet: xml-common-0.6.3-61.fc39.noarch 131/230 Installing : xml-common-0.6.3-61.fc39.noarch 131/230 Installing : svt-av1-libs-1.4.1-3.fc39.aarch64 132/230 Installing : rhash-1.4.3-3.fc39.aarch64 133/230 Installing : poppler-data-0.4.11-5.fc39.noarch 134/230 Installing : pixman-0.42.2-2.fc39.aarch64 135/230 Installing : nettle-3.9.1-2.fc39.aarch64 136/230 Installing : gnutls-3.8.6-1.fc39.aarch64 137/230 Installing : glib2-2.78.6-1.fc39.aarch64 138/230 Installing : shared-mime-info-2.2-4.fc39.aarch64 139/230 Running scriptlet: shared-mime-info-2.2-4.fc39.aarch64 139/230 Installing : gdk-pixbuf2-2.42.10-5.fc39.aarch64 140/230 Installing : libjxl-1:0.8.3-1.fc39.aarch64 141/230 Installing : libaom-3.9.0-1.fc39.aarch64 142/230 Installing : netpbm-11.02.00-2.fc39.aarch64 143/230 Installing : gts-0.7.6-46.20121130.fc39.aarch64 144/230 Installing : mpdecimal-2.5.1-7.fc39.aarch64 145/230 Installing : libpaper-1:2.1.1-1.fc39.aarch64 146/230 Installing : liblerc-4.0.0-4.fc39.aarch64 147/230 Installing : libijs-0.35-19.fc39.aarch64 148/230 Installing : libdav1d-1.2.1-2.fc39.aarch64 149/230 Installing : libavif-0.11.1-11.fc39.aarch64 150/230 Installing : libdatrie-0.2.13-7.fc39.aarch64 151/230 Installing : libthai-0.1.29-6.fc39.aarch64 152/230 Installing : libcbor-0.10.2-2.fc39.aarch64 153/230 Installing : libfido2-1.13.0-3.fc39.aarch64 154/230 Installing : openssh-clients-9.3p1-11.fc39.aarch64 155/230 Running scriptlet: openssh-clients-9.3p1-11.fc39.aarch64 155/230 Installing : git-core-2.47.0-1.fc39.aarch64 156/230 Installing : git-core-doc-2.47.0-1.fc39.noarch 157/230 Installing : perl-Git-2.47.0-1.fc39.noarch 158/230 Installing : git-2.47.0-1.fc39.aarch64 159/230 Installing : libb2-0.98.1-9.fc39.aarch64 160/230 Installing : python3-3.12.7-1.fc39.aarch64 161/230 Installing : python3-libs-3.12.7-1.fc39.aarch64 162/230 Installing : cmake-rpm-macros-3.30.5-1.fc39.noarch 163/230 Installing : python3-packaging-23.1-4.fc39.noarch 164/230 Installing : python3-rpm-generators-14-7.fc39.noarch 165/230 Installing : crypto-policies-scripts-20231204-1.git1e3a2e4.fc 166/230 Installing : nss-sysinit-3.105.0-1.fc39.aarch64 167/230 Installing : nss-3.105.0-1.fc39.aarch64 168/230 Running scriptlet: nss-3.105.0-1.fc39.aarch64 168/230 Installing : libXau-1.0.11-3.fc39.aarch64 169/230 Installing : libxcb-1.13.1-12.fc39.aarch64 170/230 Installing : libX11-1.8.9-1.fc39.aarch64 171/230 Installing : libXrender-0.9.11-3.fc39.aarch64 172/230 Installing : libXext-1.3.5-3.fc39.aarch64 173/230 Installing : libXt-1.2.1-5.fc39.aarch64 174/230 Installing : libXpm-3.5.17-1.fc39.aarch64 175/230 Installing : jsoncpp-1.9.5-5.fc39.aarch64 176/230 Installing : jbigkit-libs-2.1-26.fc39.aarch64 177/230 Installing : libtiff-4.4.0-10.fc39.aarch64 178/230 Installing : jbig2dec-libs-0.19-10.fc39.aarch64 179/230 Installing : isl-0.16.1-18.fc39.aarch64 180/230 Installing : graphite2-1.3.14-12.fc39.aarch64 181/230 Installing : cairo-1.18.0-1.fc39.aarch64 182/230 Installing : harfbuzz-8.2.1-2.fc39.aarch64 183/230 Installing : freetype-2.13.1-2.fc39.aarch64 184/230 Installing : fontconfig-2.14.2-6.fc39.aarch64 185/230 Running scriptlet: fontconfig-2.14.2-6.fc39.aarch64 185/230 Installing : cairo-gobject-1.18.0-1.fc39.aarch64 186/230 Installing : gd-2.3.3-12.fc39.aarch64 187/230 Installing : libXft-2.3.8-3.fc39.aarch64 188/230 Installing : poppler-23.08.0-1.fc39.aarch64 189/230 Installing : poppler-glib-23.08.0-1.fc39.aarch64 190/230 Installing : gc-8.2.2-4.fc39.aarch64 191/230 Installing : guile22-2.2.7-9.fc39.aarch64 192/230 Installing : make-1:4.4.1-2.fc39.aarch64 193/230 Installing : gcc-13.3.1-3.fc39.aarch64 194/230 Running scriptlet: gcc-13.3.1-3.fc39.aarch64 194/230 Installing : gcc-c++-13.3.1-3.fc39.aarch64 195/230 Installing : cmake-data-3.30.5-1.fc39.noarch 196/230 Installing : cmake-3.30.5-1.fc39.aarch64 197/230 Installing : fribidi-1.0.13-2.fc39.aarch64 198/230 Installing : pango-1.51.0-1.fc39.aarch64 199/230 Installing : librsvg2-2.57.1-2.fc39.aarch64 200/230 Installing : rsvg-pixbuf-loader-2.57.1-2.fc39.aarch64 201/230 Installing : lasi-1.1.3-11.fc39.aarch64 202/230 Installing : dbus-libs-1:1.14.10-1.fc39.aarch64 203/230 Installing : avahi-libs-0.8-24.fc39.aarch64 204/230 Installing : cups-libs-1:2.4.11-1.fc39.aarch64 205/230 Installing : clang16-resource-filesystem-16.0.6-3.fc39.aarch6 206/230 Installing : clang16-libs-16.0.6-3.fc39.aarch64 207/230 Installing : adobe-mappings-pdf-20190401-5.fc39.noarch 208/230 Installing : libgs-10.02.1-7.fc39.aarch64 209/230 Installing : graphviz-8.1.0-6.fc39.aarch64 210/230 Running scriptlet: graphviz-8.1.0-6.fc39.aarch64 210/230 Installing : libcudnn9-cuda-12-9.5.1.17-1.aarch64 211/230 Installing : cuda-nvvm-12-6-12.6.85-1.aarch64 212/230 Installing : cuda-nvrtc-12-6-12.6.85-1.aarch64 213/230 Running scriptlet: cuda-nvrtc-12-6-12.6.85-1.aarch64 213/230 Installing : cuda-crt-12-6-12.6.85-1.aarch64 214/230 Installing : cuda-cccl-12-6-12.6.77-1.aarch64 215/230 Installing : cuda-cudart-devel-12-6-12.6.77-1.aarch64 216/230 Installing : cuda-nvcc-12-6-12.6.85-1.aarch64 217/230 Installing : cuda-nvrtc-devel-12-6-12.6.85-1.aarch64 218/230 Installing : libcudnn9-devel-cuda-12-9.5.1.17-1.aarch64 219/230 Installing : doxygen-2:1.9.7-3.fc39.aarch64 220/230 Installing : annobin-plugin-gcc-12.60-1.fc39.aarch64 221/230 Running scriptlet: annobin-plugin-gcc-12.60-1.fc39.aarch64 221/230 Installing : gcc-plugin-annobin-13.3.1-3.fc39.aarch64 222/230 Running scriptlet: gcc-plugin-annobin-13.3.1-3.fc39.aarch64 222/230 Installing : cuda-gcc-11-c++-11.2.1-1.fc39.aarch64 223/230 Installing : python3-devel-3.12.7-1.fc39.aarch64 224/230 Installing : python3-setuptools-67.7.2-8.fc39.noarch 225/230 Installing : libcurand-devel-12-6-10.3.7.77-2.aarch64 226/230 Installing : libcublas-devel-12-6-12.6.4.1-1.aarch64 227/230 Installing : cuda-nvtx-12-6-12.6.77-1.aarch64 228/230 Installing : cuda-nvml-devel-12-6-12.6.77-1.aarch64 229/230 Installing : cuda-driver-devel-12-6-12.6.77-1.aarch64 230/230 Running scriptlet: cuda-toolkit-12-6-config-common-12.6.77-1.noarch 230/230 Running scriptlet: urw-base35-bookman-fonts-20200910-20.fc39.noarch 230/230 Running scriptlet: urw-base35-c059-fonts-20200910-20.fc39.noarch 230/230 Running scriptlet: urw-base35-d050000l-fonts-20200910-20.fc39.noarc 230/230 Running scriptlet: urw-base35-gothic-fonts-20200910-20.fc39.noarch 230/230 Running scriptlet: urw-base35-nimbus-mono-ps-fonts-20200910-20.fc39 230/230 Running scriptlet: urw-base35-nimbus-roman-fonts-20200910-20.fc39.n 230/230 Running scriptlet: urw-base35-nimbus-sans-fonts-20200910-20.fc39.no 230/230 Running scriptlet: urw-base35-p052-fonts-20200910-20.fc39.noarch 230/230 Running scriptlet: urw-base35-standard-symbols-ps-fonts-20200910-20 230/230 Running scriptlet: urw-base35-z003-fonts-20200910-20.fc39.noarch 230/230 Running scriptlet: crypto-policies-scripts-20231204-1.git1e3a2e4.fc 230/230 Running scriptlet: nss-3.105.0-1.fc39.aarch64 230/230 Running scriptlet: fontconfig-2.14.2-6.fc39.aarch64 230/230 Running scriptlet: libcudnn9-devel-cuda-12-9.5.1.17-1.aarch64 230/230 Running scriptlet: cuda-driver-devel-12-6-12.6.77-1.aarch64 230/230 Verifying : cuda-gcc-11-11.2.1-1.fc39.aarch64 1/230 Verifying : cuda-gcc-11-c++-11.2.1-1.fc39.aarch64 2/230 Verifying : glibc-devel-2.38-99.fc39.aarch64 3/230 Verifying : libcurand-devel-12-6-10.3.7.77-2.aarch64 4/230 Verifying : cuda-toolkit-12-6-config-common-12.6.77-1.noarch 5/230 Verifying : cuda-toolkit-12-config-common-12.6.77-1.noarch 6/230 Verifying : cuda-toolkit-config-common-12.6.77-1.noarch 7/230 Verifying : cuda-cccl-12-6-12.6.77-1.aarch64 8/230 Verifying : cuda-crt-12-6-12.6.85-1.aarch64 9/230 Verifying : cuda-cudart-12-6-12.6.77-1.aarch64 10/230 Verifying : cuda-cudart-devel-12-6-12.6.77-1.aarch64 11/230 Verifying : cuda-driver-devel-12-6-12.6.77-1.aarch64 12/230 Verifying : cuda-nvcc-12-6-12.6.85-1.aarch64 13/230 Verifying : cuda-nvml-devel-12-6-12.6.77-1.aarch64 14/230 Verifying : cuda-nvrtc-12-6-12.6.85-1.aarch64 15/230 Verifying : cuda-nvrtc-devel-12-6-12.6.85-1.aarch64 16/230 Verifying : cuda-nvtx-12-6-12.6.77-1.aarch64 17/230 Verifying : cuda-nvvm-12-6-12.6.85-1.aarch64 18/230 Verifying : libcublas-12-6-12.6.4.1-1.aarch64 19/230 Verifying : libcublas-devel-12-6-12.6.4.1-1.aarch64 20/230 Verifying : libcudnn9-cuda-12-9.5.1.17-1.aarch64 21/230 Verifying : libcudnn9-devel-cuda-12-9.5.1.17-1.aarch64 22/230 Verifying : libcurand-12-6-10.3.7.77-1.aarch64 23/230 Verifying : abattis-cantarell-vf-fonts-0.301-10.fc39.noarch 24/230 Verifying : adobe-mappings-pdf-20190401-5.fc39.noarch 25/230 Verifying : avahi-libs-0.8-24.fc39.aarch64 26/230 Verifying : cairo-1.18.0-1.fc39.aarch64 27/230 Verifying : cairo-gobject-1.18.0-1.fc39.aarch64 28/230 Verifying : clang16-libs-16.0.6-3.fc39.aarch64 29/230 Verifying : clang16-resource-filesystem-16.0.6-3.fc39.aarch6 30/230 Verifying : dbus-libs-1:1.14.10-1.fc39.aarch64 31/230 Verifying : default-fonts-core-sans-4.0-9.fc39.noarch 32/230 Verifying : doxygen-2:1.9.7-3.fc39.aarch64 33/230 Verifying : fonts-filesystem-1:2.0.5-12.fc39.noarch 34/230 Verifying : freetype-2.13.1-2.fc39.aarch64 35/230 Verifying : fribidi-1.0.13-2.fc39.aarch64 36/230 Verifying : gc-8.2.2-4.fc39.aarch64 37/230 Verifying : gd-2.3.3-12.fc39.aarch64 38/230 Verifying : gdk-pixbuf2-2.42.10-5.fc39.aarch64 39/230 Verifying : google-droid-sans-fonts-20200215-17.fc39.noarch 40/230 Verifying : graphite2-1.3.14-12.fc39.aarch64 41/230 Verifying : gts-0.7.6-46.20121130.fc39.aarch64 42/230 Verifying : guile22-2.2.7-9.fc39.aarch64 43/230 Verifying : harfbuzz-8.2.1-2.fc39.aarch64 44/230 Verifying : isl-0.16.1-18.fc39.aarch64 45/230 Verifying : jbig2dec-libs-0.19-10.fc39.aarch64 46/230 Verifying : jbigkit-libs-2.1-26.fc39.aarch64 47/230 Verifying : jsoncpp-1.9.5-5.fc39.aarch64 48/230 Verifying : lasi-1.1.3-11.fc39.aarch64 49/230 Verifying : lcms2-2.15-2.fc39.aarch64 50/230 Verifying : libICE-1.0.10-11.fc39.aarch64 51/230 Verifying : libSM-1.2.3-13.fc39.aarch64 52/230 Verifying : libXau-1.0.11-3.fc39.aarch64 53/230 Verifying : libXext-1.3.5-3.fc39.aarch64 54/230 Verifying : libXft-2.3.8-3.fc39.aarch64 55/230 Verifying : libXrender-0.9.11-3.fc39.aarch64 56/230 Verifying : libXt-1.2.1-5.fc39.aarch64 57/230 Verifying : libavif-0.11.1-11.fc39.aarch64 58/230 Verifying : libb2-0.98.1-9.fc39.aarch64 59/230 Verifying : libcbor-0.10.2-2.fc39.aarch64 60/230 Verifying : libdatrie-0.2.13-7.fc39.aarch64 61/230 Verifying : libdav1d-1.2.1-2.fc39.aarch64 62/230 Verifying : libfido2-1.13.0-3.fc39.aarch64 63/230 Verifying : libijs-0.35-19.fc39.aarch64 64/230 Verifying : libjpeg-turbo-2.1.4-3.fc39.aarch64 65/230 Verifying : liblerc-4.0.0-4.fc39.aarch64 66/230 Verifying : libmpc-1.3.1-3.fc39.aarch64 67/230 Verifying : libpaper-1:2.1.1-1.fc39.aarch64 68/230 Verifying : libpng-2:1.6.37-15.fc39.aarch64 69/230 Verifying : libthai-0.1.29-6.fc39.aarch64 70/230 Verifying : libwebp-1.3.2-2.fc39.aarch64 71/230 Verifying : libxcb-1.13.1-12.fc39.aarch64 72/230 Verifying : libxcrypt-devel-4.4.36-2.fc39.aarch64 73/230 Verifying : llvm16-libs-16.0.6-5.fc39.aarch64 74/230 Verifying : make-1:4.4.1-2.fc39.aarch64 75/230 Verifying : mpdecimal-2.5.1-7.fc39.aarch64 76/230 Verifying : netpbm-11.02.00-2.fc39.aarch64 77/230 Verifying : nettle-3.9.1-2.fc39.aarch64 78/230 Verifying : pango-1.51.0-1.fc39.aarch64 79/230 Verifying : perl-Carp-1.54-500.fc39.noarch 80/230 Verifying : perl-Data-Dumper-2.188-501.fc39.aarch64 81/230 Verifying : perl-Digest-1.20-500.fc39.noarch 82/230 Verifying : perl-Digest-MD5-2.58-500.fc39.aarch64 83/230 Verifying : perl-Encode-4:3.19-500.fc39.aarch64 84/230 Verifying : perl-Error-1:0.17029-13.fc39.noarch 85/230 Verifying : perl-Exporter-5.77-500.fc39.noarch 86/230 Verifying : perl-File-Path-2.18-500.fc39.noarch 87/230 Verifying : perl-File-Temp-1:0.231.100-500.fc39.noarch 88/230 Verifying : perl-Getopt-Long-1:2.54-500.fc39.noarch 89/230 Verifying : perl-HTTP-Tiny-0.088-3.fc39.noarch 90/230 Verifying : perl-IO-Socket-IP-0.42-1.fc39.noarch 91/230 Verifying : perl-IO-Socket-SSL-2.083-3.fc39.noarch 92/230 Verifying : perl-MIME-Base64-3.16-500.fc39.aarch64 93/230 Verifying : perl-Mozilla-CA-20230801-1.fc39.noarch 94/230 Verifying : perl-Net-SSLeay-1.92-10.fc39.aarch64 95/230 Verifying : perl-PathTools-3.89-500.fc39.aarch64 96/230 Verifying : perl-Pod-Escapes-1:1.07-500.fc39.noarch 97/230 Verifying : perl-Pod-Perldoc-3.28.01-501.fc39.noarch 98/230 Verifying : perl-Pod-Simple-1:3.45-4.fc39.noarch 99/230 Verifying : perl-Pod-Usage-4:2.03-500.fc39.noarch 100/230 Verifying : perl-Scalar-List-Utils-5:1.63-500.fc39.aarch64 101/230 Verifying : perl-Socket-4:2.037-3.fc39.aarch64 102/230 Verifying : perl-Storable-1:3.32-500.fc39.aarch64 103/230 Verifying : perl-Term-ANSIColor-5.01-501.fc39.noarch 104/230 Verifying : perl-Term-Cap-1.18-500.fc39.noarch 105/230 Verifying : perl-TermReadKey-2.38-18.fc39.aarch64 106/230 Verifying : perl-Text-ParseWords-3.31-500.fc39.noarch 107/230 Verifying : perl-Text-Tabs+Wrap-2023.0511-3.fc39.noarch 108/230 Verifying : perl-Time-Local-2:1.350-3.fc39.noarch 109/230 Verifying : perl-URI-5.21-1.fc39.noarch 110/230 Verifying : perl-constant-1.33-501.fc39.noarch 111/230 Verifying : perl-libnet-3.15-501.fc39.noarch 112/230 Verifying : perl-parent-1:0.241-500.fc39.noarch 113/230 Verifying : perl-podlators-1:5.01-500.fc39.noarch 114/230 Verifying : pixman-0.42.2-2.fc39.aarch64 115/230 Verifying : poppler-23.08.0-1.fc39.aarch64 116/230 Verifying : poppler-data-0.4.11-5.fc39.noarch 117/230 Verifying : poppler-glib-23.08.0-1.fc39.aarch64 118/230 Verifying : python3-packaging-23.1-4.fc39.noarch 119/230 Verifying : python3-rpm-generators-14-7.fc39.noarch 120/230 Verifying : rhash-1.4.3-3.fc39.aarch64 121/230 Verifying : shared-mime-info-2.2-4.fc39.aarch64 122/230 Verifying : svt-av1-libs-1.4.1-3.fc39.aarch64 123/230 Verifying : xml-common-0.6.3-61.fc39.noarch 124/230 Verifying : adobe-mappings-cmap-20231115-1.fc39.noarch 125/230 Verifying : adobe-mappings-cmap-deprecated-20231115-1.fc39.n 126/230 Verifying : annobin-docs-12.60-1.fc39.noarch 127/230 Verifying : annobin-plugin-gcc-12.60-1.fc39.aarch64 128/230 Verifying : cmake-3.30.5-1.fc39.aarch64 129/230 Verifying : cmake-data-3.30.5-1.fc39.noarch 130/230 Verifying : cmake-filesystem-3.30.5-1.fc39.aarch64 131/230 Verifying : cmake-rpm-macros-3.30.5-1.fc39.noarch 132/230 Verifying : cpp-13.3.1-3.fc39.aarch64 133/230 Verifying : crypto-policies-scripts-20231204-1.git1e3a2e4.fc 134/230 Verifying : cups-libs-1:2.4.11-1.fc39.aarch64 135/230 Verifying : emacs-filesystem-1:29.4-2.fc39.noarch 136/230 Verifying : expat-2.6.3-1.fc39.aarch64 137/230 Verifying : fontconfig-2.14.2-6.fc39.aarch64 138/230 Verifying : gcc-13.3.1-3.fc39.aarch64 139/230 Verifying : gcc-c++-13.3.1-3.fc39.aarch64 140/230 Verifying : gcc-plugin-annobin-13.3.1-3.fc39.aarch64 141/230 Verifying : git-2.47.0-1.fc39.aarch64 142/230 Verifying : git-core-2.47.0-1.fc39.aarch64 143/230 Verifying : git-core-doc-2.47.0-1.fc39.noarch 144/230 Verifying : glib2-2.78.6-1.fc39.aarch64 145/230 Verifying : gnutls-3.8.6-1.fc39.aarch64 146/230 Verifying : google-noto-fonts-common-20240101-1.fc39.noarch 147/230 Verifying : google-noto-sans-vf-fonts-20240101-1.fc39.noarch 148/230 Verifying : graphviz-8.1.0-6.fc39.aarch64 149/230 Verifying : groff-base-1.23.0-3.fc39.aarch64 150/230 Verifying : highway-1.1.0-1.fc39.aarch64 151/230 Verifying : kernel-headers-6.11.3-100.fc39.aarch64 152/230 Verifying : less-633-4.fc39.aarch64 153/230 Verifying : libX11-1.8.9-1.fc39.aarch64 154/230 Verifying : libX11-common-1.8.9-1.fc39.noarch 155/230 Verifying : libXpm-3.5.17-1.fc39.aarch64 156/230 Verifying : libaom-3.9.0-1.fc39.aarch64 157/230 Verifying : libasan-13.3.1-3.fc39.aarch64 158/230 Verifying : libatomic-13.3.1-3.fc39.aarch64 159/230 Verifying : libedit-3.1-53.20240808cvs.fc39.aarch64 160/230 Verifying : libgs-10.02.1-7.fc39.aarch64 161/230 Verifying : libimagequant-4.0.3-5.fc39.aarch64 162/230 Verifying : libjxl-1:0.8.3-1.fc39.aarch64 163/230 Verifying : librsvg2-2.57.1-2.fc39.aarch64 164/230 Verifying : libstdc++-devel-13.3.1-3.fc39.aarch64 165/230 Verifying : libtiff-4.4.0-10.fc39.aarch64 166/230 Verifying : libubsan-13.3.1-3.fc39.aarch64 167/230 Verifying : libuv-1:1.49.2-1.fc39.aarch64 168/230 Verifying : ncurses-6.4-7.20230520.fc39.1.aarch64 169/230 Verifying : nspr-4.35.0-24.fc39.aarch64 170/230 Verifying : nss-3.105.0-1.fc39.aarch64 171/230 Verifying : nss-softokn-3.105.0-1.fc39.aarch64 172/230 Verifying : nss-softokn-freebl-3.105.0-1.fc39.aarch64 173/230 Verifying : nss-sysinit-3.105.0-1.fc39.aarch64 174/230 Verifying : nss-util-3.105.0-1.fc39.aarch64 175/230 Verifying : openjpeg2-2.5.2-1.fc39.aarch64 176/230 Verifying : openssh-9.3p1-11.fc39.aarch64 177/230 Verifying : openssh-clients-9.3p1-11.fc39.aarch64 178/230 Verifying : perl-AutoLoader-5.74-502.fc39.noarch 179/230 Verifying : perl-B-1.88-502.fc39.aarch64 180/230 Verifying : perl-Class-Struct-0.68-502.fc39.noarch 181/230 Verifying : perl-DynaLoader-1.54-502.fc39.aarch64 182/230 Verifying : perl-Errno-1.37-502.fc39.aarch64 183/230 Verifying : perl-Fcntl-1.15-502.fc39.aarch64 184/230 Verifying : perl-File-Basename-2.86-502.fc39.noarch 185/230 Verifying : perl-File-Find-1.43-502.fc39.noarch 186/230 Verifying : perl-File-stat-1.13-502.fc39.noarch 187/230 Verifying : perl-FileHandle-2.05-502.fc39.noarch 188/230 Verifying : perl-Getopt-Std-1.13-502.fc39.noarch 189/230 Verifying : perl-Git-2.47.0-1.fc39.noarch 190/230 Verifying : perl-IO-1.52-502.fc39.aarch64 191/230 Verifying : perl-IPC-Open3-1.22-502.fc39.noarch 192/230 Verifying : perl-POSIX-2.13-502.fc39.aarch64 193/230 Verifying : perl-SelectSaver-1.02-502.fc39.noarch 194/230 Verifying : perl-Symbol-1.09-502.fc39.noarch 195/230 Verifying : perl-base-2.27-502.fc39.noarch 196/230 Verifying : perl-if-0.61.000-502.fc39.noarch 197/230 Verifying : perl-interpreter-4:5.38.2-502.fc39.aarch64 198/230 Verifying : perl-lib-0.65-502.fc39.aarch64 199/230 Verifying : perl-libs-4:5.38.2-502.fc39.aarch64 200/230 Verifying : perl-locale-1.10-502.fc39.noarch 201/230 Verifying : perl-mro-1.28-502.fc39.aarch64 202/230 Verifying : perl-overload-1.37-502.fc39.noarch 203/230 Verifying : perl-overloading-0.02-502.fc39.noarch 204/230 Verifying : perl-vars-1.05-502.fc39.noarch 205/230 Verifying : pyproject-rpm-macros-1.16.0-1.fc39.noarch 206/230 Verifying : python-pip-wheel-23.2.1-2.fc39.noarch 207/230 Verifying : python-rpm-macros-3.12-8.fc39.noarch 208/230 Verifying : python3-3.12.7-1.fc39.aarch64 209/230 Verifying : python3-devel-3.12.7-1.fc39.aarch64 210/230 Verifying : python3-libs-3.12.7-1.fc39.aarch64 211/230 Verifying : python3-rpm-macros-3.12-8.fc39.noarch 212/230 Verifying : python3-setuptools-67.7.2-8.fc39.noarch 213/230 Verifying : rav1e-libs-0.7.1-4.fc39.aarch64 214/230 Verifying : rsvg-pixbuf-loader-2.57.1-2.fc39.aarch64 215/230 Verifying : tzdata-2024a-2.fc39.noarch 216/230 Verifying : urw-base35-bookman-fonts-20200910-20.fc39.noarch 217/230 Verifying : urw-base35-c059-fonts-20200910-20.fc39.noarch 218/230 Verifying : urw-base35-d050000l-fonts-20200910-20.fc39.noarc 219/230 Verifying : urw-base35-fonts-20200910-20.fc39.noarch 220/230 Verifying : urw-base35-fonts-common-20200910-20.fc39.noarch 221/230 Verifying : urw-base35-gothic-fonts-20200910-20.fc39.noarch 222/230 Verifying : urw-base35-nimbus-mono-ps-fonts-20200910-20.fc39 223/230 Verifying : urw-base35-nimbus-roman-fonts-20200910-20.fc39.n 224/230 Verifying : urw-base35-nimbus-sans-fonts-20200910-20.fc39.no 225/230 Verifying : urw-base35-p052-fonts-20200910-20.fc39.noarch 226/230 Verifying : urw-base35-standard-symbols-ps-fonts-20200910-20 227/230 Verifying : urw-base35-z003-fonts-20200910-20.fc39.noarch 228/230 Verifying : vim-filesystem-2:9.1.825-1.fc39.noarch 229/230 Verifying : xapian-core-libs-1.4.26-1.fc39.aarch64 230/230 Installed: abattis-cantarell-vf-fonts-0.301-10.fc39.noarch adobe-mappings-cmap-20231115-1.fc39.noarch adobe-mappings-cmap-deprecated-20231115-1.fc39.noarch adobe-mappings-pdf-20190401-5.fc39.noarch annobin-docs-12.60-1.fc39.noarch annobin-plugin-gcc-12.60-1.fc39.aarch64 avahi-libs-0.8-24.fc39.aarch64 cairo-1.18.0-1.fc39.aarch64 cairo-gobject-1.18.0-1.fc39.aarch64 clang16-libs-16.0.6-3.fc39.aarch64 clang16-resource-filesystem-16.0.6-3.fc39.aarch64 cmake-3.30.5-1.fc39.aarch64 cmake-data-3.30.5-1.fc39.noarch cmake-filesystem-3.30.5-1.fc39.aarch64 cmake-rpm-macros-3.30.5-1.fc39.noarch cpp-13.3.1-3.fc39.aarch64 crypto-policies-scripts-20231204-1.git1e3a2e4.fc39.noarch cuda-cccl-12-6-12.6.77-1.aarch64 cuda-crt-12-6-12.6.85-1.aarch64 cuda-cudart-12-6-12.6.77-1.aarch64 cuda-cudart-devel-12-6-12.6.77-1.aarch64 cuda-driver-devel-12-6-12.6.77-1.aarch64 cuda-gcc-11-11.2.1-1.fc39.aarch64 cuda-gcc-11-c++-11.2.1-1.fc39.aarch64 cuda-nvcc-12-6-12.6.85-1.aarch64 cuda-nvml-devel-12-6-12.6.77-1.aarch64 cuda-nvrtc-12-6-12.6.85-1.aarch64 cuda-nvrtc-devel-12-6-12.6.85-1.aarch64 cuda-nvtx-12-6-12.6.77-1.aarch64 cuda-nvvm-12-6-12.6.85-1.aarch64 cuda-toolkit-12-6-config-common-12.6.77-1.noarch cuda-toolkit-12-config-common-12.6.77-1.noarch cuda-toolkit-config-common-12.6.77-1.noarch cups-libs-1:2.4.11-1.fc39.aarch64 dbus-libs-1:1.14.10-1.fc39.aarch64 default-fonts-core-sans-4.0-9.fc39.noarch doxygen-2:1.9.7-3.fc39.aarch64 emacs-filesystem-1:29.4-2.fc39.noarch expat-2.6.3-1.fc39.aarch64 fontconfig-2.14.2-6.fc39.aarch64 fonts-filesystem-1:2.0.5-12.fc39.noarch freetype-2.13.1-2.fc39.aarch64 fribidi-1.0.13-2.fc39.aarch64 gc-8.2.2-4.fc39.aarch64 gcc-13.3.1-3.fc39.aarch64 gcc-c++-13.3.1-3.fc39.aarch64 gcc-plugin-annobin-13.3.1-3.fc39.aarch64 gd-2.3.3-12.fc39.aarch64 gdk-pixbuf2-2.42.10-5.fc39.aarch64 git-2.47.0-1.fc39.aarch64 git-core-2.47.0-1.fc39.aarch64 git-core-doc-2.47.0-1.fc39.noarch glib2-2.78.6-1.fc39.aarch64 glibc-devel-2.38-99.fc39.aarch64 gnutls-3.8.6-1.fc39.aarch64 google-droid-sans-fonts-20200215-17.fc39.noarch google-noto-fonts-common-20240101-1.fc39.noarch google-noto-sans-vf-fonts-20240101-1.fc39.noarch graphite2-1.3.14-12.fc39.aarch64 graphviz-8.1.0-6.fc39.aarch64 groff-base-1.23.0-3.fc39.aarch64 gts-0.7.6-46.20121130.fc39.aarch64 guile22-2.2.7-9.fc39.aarch64 harfbuzz-8.2.1-2.fc39.aarch64 highway-1.1.0-1.fc39.aarch64 isl-0.16.1-18.fc39.aarch64 jbig2dec-libs-0.19-10.fc39.aarch64 jbigkit-libs-2.1-26.fc39.aarch64 jsoncpp-1.9.5-5.fc39.aarch64 kernel-headers-6.11.3-100.fc39.aarch64 lasi-1.1.3-11.fc39.aarch64 lcms2-2.15-2.fc39.aarch64 less-633-4.fc39.aarch64 libICE-1.0.10-11.fc39.aarch64 libSM-1.2.3-13.fc39.aarch64 libX11-1.8.9-1.fc39.aarch64 libX11-common-1.8.9-1.fc39.noarch libXau-1.0.11-3.fc39.aarch64 libXext-1.3.5-3.fc39.aarch64 libXft-2.3.8-3.fc39.aarch64 libXpm-3.5.17-1.fc39.aarch64 libXrender-0.9.11-3.fc39.aarch64 libXt-1.2.1-5.fc39.aarch64 libaom-3.9.0-1.fc39.aarch64 libasan-13.3.1-3.fc39.aarch64 libatomic-13.3.1-3.fc39.aarch64 libavif-0.11.1-11.fc39.aarch64 libb2-0.98.1-9.fc39.aarch64 libcbor-0.10.2-2.fc39.aarch64 libcublas-12-6-12.6.4.1-1.aarch64 libcublas-devel-12-6-12.6.4.1-1.aarch64 libcudnn9-cuda-12-9.5.1.17-1.aarch64 libcudnn9-devel-cuda-12-9.5.1.17-1.aarch64 libcurand-12-6-10.3.7.77-1.aarch64 libcurand-devel-12-6-10.3.7.77-2.aarch64 libdatrie-0.2.13-7.fc39.aarch64 libdav1d-1.2.1-2.fc39.aarch64 libedit-3.1-53.20240808cvs.fc39.aarch64 libfido2-1.13.0-3.fc39.aarch64 libgs-10.02.1-7.fc39.aarch64 libijs-0.35-19.fc39.aarch64 libimagequant-4.0.3-5.fc39.aarch64 libjpeg-turbo-2.1.4-3.fc39.aarch64 libjxl-1:0.8.3-1.fc39.aarch64 liblerc-4.0.0-4.fc39.aarch64 libmpc-1.3.1-3.fc39.aarch64 libpaper-1:2.1.1-1.fc39.aarch64 libpng-2:1.6.37-15.fc39.aarch64 librsvg2-2.57.1-2.fc39.aarch64 libstdc++-devel-13.3.1-3.fc39.aarch64 libthai-0.1.29-6.fc39.aarch64 libtiff-4.4.0-10.fc39.aarch64 libubsan-13.3.1-3.fc39.aarch64 libuv-1:1.49.2-1.fc39.aarch64 libwebp-1.3.2-2.fc39.aarch64 libxcb-1.13.1-12.fc39.aarch64 libxcrypt-devel-4.4.36-2.fc39.aarch64 llvm16-libs-16.0.6-5.fc39.aarch64 make-1:4.4.1-2.fc39.aarch64 mpdecimal-2.5.1-7.fc39.aarch64 ncurses-6.4-7.20230520.fc39.1.aarch64 netpbm-11.02.00-2.fc39.aarch64 nettle-3.9.1-2.fc39.aarch64 nspr-4.35.0-24.fc39.aarch64 nss-3.105.0-1.fc39.aarch64 nss-softokn-3.105.0-1.fc39.aarch64 nss-softokn-freebl-3.105.0-1.fc39.aarch64 nss-sysinit-3.105.0-1.fc39.aarch64 nss-util-3.105.0-1.fc39.aarch64 openjpeg2-2.5.2-1.fc39.aarch64 openssh-9.3p1-11.fc39.aarch64 openssh-clients-9.3p1-11.fc39.aarch64 pango-1.51.0-1.fc39.aarch64 perl-AutoLoader-5.74-502.fc39.noarch perl-B-1.88-502.fc39.aarch64 perl-Carp-1.54-500.fc39.noarch perl-Class-Struct-0.68-502.fc39.noarch perl-Data-Dumper-2.188-501.fc39.aarch64 perl-Digest-1.20-500.fc39.noarch perl-Digest-MD5-2.58-500.fc39.aarch64 perl-DynaLoader-1.54-502.fc39.aarch64 perl-Encode-4:3.19-500.fc39.aarch64 perl-Errno-1.37-502.fc39.aarch64 perl-Error-1:0.17029-13.fc39.noarch perl-Exporter-5.77-500.fc39.noarch perl-Fcntl-1.15-502.fc39.aarch64 perl-File-Basename-2.86-502.fc39.noarch perl-File-Find-1.43-502.fc39.noarch perl-File-Path-2.18-500.fc39.noarch perl-File-Temp-1:0.231.100-500.fc39.noarch perl-File-stat-1.13-502.fc39.noarch perl-FileHandle-2.05-502.fc39.noarch perl-Getopt-Long-1:2.54-500.fc39.noarch perl-Getopt-Std-1.13-502.fc39.noarch perl-Git-2.47.0-1.fc39.noarch perl-HTTP-Tiny-0.088-3.fc39.noarch perl-IO-1.52-502.fc39.aarch64 perl-IO-Socket-IP-0.42-1.fc39.noarch perl-IO-Socket-SSL-2.083-3.fc39.noarch perl-IPC-Open3-1.22-502.fc39.noarch perl-MIME-Base64-3.16-500.fc39.aarch64 perl-Mozilla-CA-20230801-1.fc39.noarch perl-Net-SSLeay-1.92-10.fc39.aarch64 perl-POSIX-2.13-502.fc39.aarch64 perl-PathTools-3.89-500.fc39.aarch64 perl-Pod-Escapes-1:1.07-500.fc39.noarch perl-Pod-Perldoc-3.28.01-501.fc39.noarch perl-Pod-Simple-1:3.45-4.fc39.noarch perl-Pod-Usage-4:2.03-500.fc39.noarch perl-Scalar-List-Utils-5:1.63-500.fc39.aarch64 perl-SelectSaver-1.02-502.fc39.noarch perl-Socket-4:2.037-3.fc39.aarch64 perl-Storable-1:3.32-500.fc39.aarch64 perl-Symbol-1.09-502.fc39.noarch perl-Term-ANSIColor-5.01-501.fc39.noarch perl-Term-Cap-1.18-500.fc39.noarch perl-TermReadKey-2.38-18.fc39.aarch64 perl-Text-ParseWords-3.31-500.fc39.noarch perl-Text-Tabs+Wrap-2023.0511-3.fc39.noarch perl-Time-Local-2:1.350-3.fc39.noarch perl-URI-5.21-1.fc39.noarch perl-base-2.27-502.fc39.noarch perl-constant-1.33-501.fc39.noarch perl-if-0.61.000-502.fc39.noarch perl-interpreter-4:5.38.2-502.fc39.aarch64 perl-lib-0.65-502.fc39.aarch64 perl-libnet-3.15-501.fc39.noarch perl-libs-4:5.38.2-502.fc39.aarch64 perl-locale-1.10-502.fc39.noarch perl-mro-1.28-502.fc39.aarch64 perl-overload-1.37-502.fc39.noarch perl-overloading-0.02-502.fc39.noarch perl-parent-1:0.241-500.fc39.noarch perl-podlators-1:5.01-500.fc39.noarch perl-vars-1.05-502.fc39.noarch pixman-0.42.2-2.fc39.aarch64 poppler-23.08.0-1.fc39.aarch64 poppler-data-0.4.11-5.fc39.noarch poppler-glib-23.08.0-1.fc39.aarch64 pyproject-rpm-macros-1.16.0-1.fc39.noarch python-pip-wheel-23.2.1-2.fc39.noarch python-rpm-macros-3.12-8.fc39.noarch python3-3.12.7-1.fc39.aarch64 python3-devel-3.12.7-1.fc39.aarch64 python3-libs-3.12.7-1.fc39.aarch64 python3-packaging-23.1-4.fc39.noarch python3-rpm-generators-14-7.fc39.noarch python3-rpm-macros-3.12-8.fc39.noarch python3-setuptools-67.7.2-8.fc39.noarch rav1e-libs-0.7.1-4.fc39.aarch64 rhash-1.4.3-3.fc39.aarch64 rsvg-pixbuf-loader-2.57.1-2.fc39.aarch64 shared-mime-info-2.2-4.fc39.aarch64 svt-av1-libs-1.4.1-3.fc39.aarch64 tzdata-2024a-2.fc39.noarch urw-base35-bookman-fonts-20200910-20.fc39.noarch urw-base35-c059-fonts-20200910-20.fc39.noarch urw-base35-d050000l-fonts-20200910-20.fc39.noarch urw-base35-fonts-20200910-20.fc39.noarch urw-base35-fonts-common-20200910-20.fc39.noarch urw-base35-gothic-fonts-20200910-20.fc39.noarch urw-base35-nimbus-mono-ps-fonts-20200910-20.fc39.noarch urw-base35-nimbus-roman-fonts-20200910-20.fc39.noarch urw-base35-nimbus-sans-fonts-20200910-20.fc39.noarch urw-base35-p052-fonts-20200910-20.fc39.noarch urw-base35-standard-symbols-ps-fonts-20200910-20.fc39.noarch urw-base35-z003-fonts-20200910-20.fc39.noarch vim-filesystem-2:9.1.825-1.fc39.noarch xapian-core-libs-1.4.26-1.fc39.aarch64 xml-common-0.6.3-61.fc39.noarch Complete! Finish: build setup for cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm Start: rpmbuild cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm sh: -c: line 1: unexpected EOF while looking for matching `"' Building target platforms: aarch64 Building for target aarch64 setting SOURCE_DATE_EPOCH=1636416000 Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.T9ka7o + umask 022 + cd /builddir/build/BUILD + cd /builddir/build/BUILD + rm -rf cutlass + /usr/bin/mkdir -p cutlass + cd cutlass + rm -rf /builddir/build/BUILD/cutlass-SPECPARTS + /usr/bin/mkdir -p /builddir/build/BUILD/cutlass-SPECPARTS + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + git clone --depth 1 -n -b main https://github.com/NVIDIA/cutlass.git . Cloning into '.'... + git fetch --depth 1 origin cc3c29a81a140f7b97045718fb88eb0664c37bd7 From https://github.com/NVIDIA/cutlass * branch cc3c29a81a140f7b97045718fb88eb0664c37bd7 -> FETCH_HEAD + git reset --hard cc3c29a81a140f7b97045718fb88eb0664c37bd7 HEAD is now at cc3c29a CUTLASS 3.6.0 (#1850) + git --no-pager log --format=fuller commit cc3c29a81a140f7b97045718fb88eb0664c37bd7 Author: Yujia Zhai AuthorDate: Wed Oct 9 12:33:27 2024 -0700 Commit: GitHub CommitDate: Wed Oct 9 15:33:27 2024 -0400 CUTLASS 3.6.0 (#1850) * v3.6 * update changelog * update readme * fix typo * fixing typos * hopper gemm with weight prefetch --------- Co-authored-by: yuzhai Co-authored-by: Haicheng Wu Patch #0 (cutlass-fp16.patch): + echo 'Patch #0 (cutlass-fp16.patch):' + /usr/bin/patch --no-backup-if-mismatch -f -p0 -b --suffix .fp16~ --fuzz=100 patching file include/cutlass/functional.h Hunk #1 succeeded at 221 with fuzz 3 (offset 132 lines). + sed -i /-rpath/d CMakeLists.txt + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.LuWkAb + umask 022 + cd /builddir/build/BUILD + CFLAGS=' ' + export CFLAGS + CXXFLAGS=' ' + export CXXFLAGS + FFLAGS=' -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS=' -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=gcc + export CC + CXX=g++ + export CXX + cd cutlass + mkdir -p build + pushd build ~/build/BUILD/cutlass/build ~/build/BUILD/cutlass + export LD_LIBRARY_PATH=/usr/local/cuda-12.6/lib64/ + LD_LIBRARY_PATH=/usr/local/cuda-12.6/lib64/ + CFLAGS=' ' + export CFLAGS + CXXFLAGS=' ' + export CXXFLAGS + FFLAGS=' -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS=' -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=gcc + export CC + CXX=g++ + export CXX + /usr/bin/cmake -DCMAKE_C_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_CXX_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_Fortran_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_VERBOSE_MAKEFILE:BOOL=ON -DCMAKE_INSTALL_DO_STRIP:BOOL=OFF -DCMAKE_INSTALL_PREFIX:PATH=/usr -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_SUFFIX=64 -DBUILD_SHARED_LIBS:BOOL=ON .. -DCMAKE_SKIP_RPATH=ON -DCMAKE_VERBOSE_MAKEFILE=OFF -DCMAKE_BUILD_TYPE=Release -DCMAKE_EXE_LINKER_FLAGS=/usr/lib64/libstdc++.so.6 -DBUILD_TESTING=OFF -DCUTLASS_ENABLE_TESTS=OFF -DCUTLASS_ENABLE_PROFILER=ON -DCUTLASS_ENABLE_EXAMPLES=OFF -DCUDA_PROPAGATE_HOST_FLAGS=OFF -DCMAKE_CUDA_HOST_COMPILER=/usr/bin/cuda-c++ -DCUTLASS_NVCC_EMBED_PTX=ON -DCUTLASS_NVCC_EMBED_CUBIN=ON '-DCUTLASS_NVCC_ARCHS=52;61;75;86;89;90' '-DCMAKE_CUDA_FLAGS=-Wl,--no-relax -Xfatbin=-compress-all --compiler-options -fPIC -Wno-deprecated-gpu-targets -allow-unsupported-compiler -D_SERIALIZE_H_INCLUDED' -DCMAKE_CUDA_COMPILER=/usr/local/cuda-12.6/bin/nvcc -- CMake Version: 3.30.5 -- CUTLASS 3.6.0 -- The CXX compiler identification is GNU 13.3.1 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/g++ - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- The CUDA compiler identification is NVIDIA 12.6.85 -- Detecting CUDA compiler ABI info -- Detecting CUDA compiler ABI info - done -- Check for working CUDA compiler: /usr/local/cuda-12.6/bin/nvcc - skipped -- Detecting CUDA compile features -- Detecting CUDA compile features - done -- CUDART: /usr/local/cuda-12.6/lib64/libcudart.so -- CUDA Driver: /usr/local/cuda-12.6/lib64/stubs/libcuda.so -- NVRTC: /usr/local/cuda-12.6/lib64/libnvrtc.so -- Default Install Location: /usr -- Found Python3: /usr/bin/python3.12 (found suitable version "3.12.7", minimum required is "3.5") found components: Interpreter -- Make cute::tuple be the new standard-layout tuple type CMake Warning at CMakeLists.txt:166 (message): Using unsupported or deprecated compute capabilities 52;61. Support may be removed in future versions. -- CUDA Compilation Architectures: 52;61;75;86;89;90 -- Enable caching of reference results in conv unit tests -- Enable rigorous conv problem sizes in conv unit tests -- Using NVCC flags: --expt-relaxed-constexpr;-DCUTE_USE_PACKED_TUPLE=1;-DCUTLASS_TEST_LEVEL=0;-DCUTLASS_TEST_ENABLE_CACHED_RESULTS=1;-DCUTLASS_CONV_UNIT_TEST_RIGOROUS_SIZE_ENABLED=1;-DCUTLASS_DEBUG_TRACE_LEVEL=0;-Xcompiler=-Wconversion;-Xcompiler=-fno-strict-aliasing -- CUTLASS Revision: cc3c29a -- Configuring cublas ... -- cuBLAS Disabled. -- Configuring cuBLAS ... done. -- Completed generation of library instances. See /builddir/build/BUILD/cutlass/build/tools/library/library_instance_generation.log for more information. -- Configuring done (7.2s) -- Generating done (2.9s) CMake Warning: Manually-specified variables were not used by the project: CMAKE_C_FLAGS_RELEASE CMAKE_Fortran_FLAGS_RELEASE CMAKE_INSTALL_DO_STRIP CUDA_PROPAGATE_HOST_FLAGS INCLUDE_INSTALL_DIR LIB_INSTALL_DIR LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /builddir/build/BUILD/cutlass/build + make -j4 [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_cgemm_objs.dir/generated/gemm/50/cgemm/all_sm50_cgemm_gemm_operations.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684symm_objs.dir/generated/symm/90/z1684symm/all_sm90_z1684symm_symm_operations.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_dgemm_objs.dir/generated/gemm/50/dgemm/all_sm50_dgemm_gemm_operations.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/handle.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_cgemm_objs.dir/generated/gemm/50/cgemm/cutlass_simt_cgemm_128x64_8x2_nn_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684symm_objs.dir/generated/symm/90/z1684symm/cutlass_tensorop_z1684symm_128x64x8_1x1x1_3_n_ls_l_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_dgemm_objs.dir/generated/gemm/50/dgemm/cutlass_simt_dgemm_128x128_8x2_nn_align1.cu.o [ 0%] Building CXX object tools/library/CMakeFiles/cutlass_library_objs.dir/src/manifest.cpp.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/operation_table.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/singleton.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684symm_objs.dir/generated/symm/90/z1684symm/cutlass_tensorop_z1684symm_128x64x8_1x1x1_3_n_ls_u_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/util.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_dgemm_objs.dir/generated/gemm/50/dgemm/cutlass_simt_dgemm_128x128_8x2_nt_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_int4.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_cgemm_objs.dir/generated/gemm/50/cgemm/cutlass_simt_cgemm_128x64_8x2_nt_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684symm_objs.dir/generated/symm/90/z1684symm/cutlass_tensorop_z1684symm_128x64x8_1x1x1_3_n_rs_l_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_dgemm_objs.dir/generated/gemm/50/dgemm/cutlass_simt_dgemm_128x128_8x2_tn_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684symm_objs.dir/generated/symm/90/z1684symm/cutlass_tensorop_z1684symm_128x64x8_1x1x1_3_n_rs_u_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_cgemm_objs.dir/generated/gemm/50/cgemm/cutlass_simt_cgemm_128x64_8x2_tn_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_dgemm_objs.dir/generated/gemm/50/dgemm/cutlass_simt_dgemm_128x128_8x2_tt_align1.cu.o [ 0%] Built target cutlass_library_symm_sm90_z1684symm_objs [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_cgemm_objs.dir/generated/gemm/50/cgemm/cutlass_simt_cgemm_128x64_8x2_tt_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_s8_s8_s32.cu.o [ 0%] Built target cutlass_library_gemm_sm50_dgemm_objs [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_u8_u8_s32.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_sgemm_objs.dir/generated/gemm/50/sgemm/all_sm50_sgemm_gemm_operations.cu.o [ 0%] Built target cutlass_library_gemm_sm50_cgemm_objs [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_sgemm_objs.dir/generated/gemm/50/sgemm/cutlass_simt_sgemm_128x128_8x2_nn_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_int8_interleaved_32.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_sgemm_objs.dir/generated/gemm/50/sgemm/cutlass_simt_sgemm_128x128_8x2_nt_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm60_hgemm_objs.dir/generated/gemm/60/hgemm/all_sm60_hgemm_gemm_operations.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_sgemm_objs.dir/generated/gemm/50/sgemm/cutlass_simt_sgemm_128x128_8x2_tn_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm60_hgemm_objs.dir/generated/gemm/60/hgemm/cutlass_simt_hgemm_256x128_8x2_nn_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm50_sgemm_objs.dir/generated/gemm/50/sgemm/cutlass_simt_sgemm_128x128_8x2_tt_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm60_hgemm_objs.dir/generated/gemm/60/hgemm/cutlass_simt_hgemm_256x128_8x2_nt_align1.cu.o [ 0%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm60_hgemm_objs.dir/generated/gemm/60/hgemm/cutlass_simt_hgemm_256x128_8x2_tn_align1.cu.o [ 0%] Built target cutlass_library_gemm_sm50_sgemm_objs [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm60_hgemm_objs.dir/generated/gemm/60/hgemm/cutlass_simt_hgemm_256x128_8x2_tt_align1.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_igemm_s8_objs.dir/generated/gemm/61/igemm_s8/all_sm61_igemm_s8_gemm_operations.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_igemm_s8_objs.dir/generated/gemm/61/igemm_s8/cutlass_simt_igemm_s8_128x128_32x2_nn_align1.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_igemm_s8_objs.dir/generated/gemm/61/igemm_s8/cutlass_simt_igemm_s8_128x128_32x2_nt_align1.cu.o [ 1%] Built target cutlass_library_gemm_sm60_hgemm_objs [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_igemm_s8_objs.dir/generated/gemm/61/igemm_s8/cutlass_simt_igemm_s8_128x128_32x2_tn_align1.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_int8_interleaved_64.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_s8_igemm_s8_objs.dir/generated/gemm/61/s8_igemm_s8/all_sm61_s8_igemm_s8_gemm_operations.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_igemm_s8_objs.dir/generated/gemm/61/igemm_s8/cutlass_simt_igemm_s8_128x128_32x2_tt_align1.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_s8_igemm_s8_objs.dir/generated/gemm/61/s8_igemm_s8/cutlass_simt_s8_igemm_s8_128x128_32x2_nn_align1.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_s8_igemm_s8_objs.dir/generated/gemm/61/s8_igemm_s8/cutlass_simt_s8_igemm_s8_128x128_32x2_nt_align1.cu.o [ 1%] Built target cutlass_library_gemm_sm61_igemm_s8_objs [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_e4m3a_e4m3out.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_f16_objs.dir/generated/gemm/70/f16_s884gemm_f16/all_sm70_f16_s884gemm_f16_gemm_operations.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_s8_igemm_s8_objs.dir/generated/gemm/61/s8_igemm_s8/cutlass_simt_s8_igemm_s8_128x128_32x2_tn_align1.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_f16_objs.dir/generated/gemm/70/f16_s884gemm_f16/cutlass_tensorop_f16_s884gemm_f16_256x128_32x2_nn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm61_s8_igemm_s8_objs.dir/generated/gemm/61/s8_igemm_s8/cutlass_simt_s8_igemm_s8_128x128_32x2_tt_align1.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_f16_objs.dir/generated/gemm/70/f16_s884gemm_f16/cutlass_tensorop_f16_s884gemm_f16_256x128_32x2_nt_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_f16_objs.dir/generated/gemm/70/f16_s884gemm_f16/cutlass_tensorop_f16_s884gemm_f16_256x128_32x2_tn_align8.cu.o [ 1%] Built target cutlass_library_gemm_sm61_s8_igemm_s8_objs [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_e5m2a_e4m3out.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_e4m3a_e5m2out.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_f16_objs.dir/generated/gemm/70/f16_s884gemm_f16/cutlass_tensorop_f16_s884gemm_f16_256x128_32x2_tt_align8.cu.o [ 1%] Built target cutlass_library_gemm_sm70_f16_s884gemm_f16_objs [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_e5m2a_e5m2out.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/all_sm70_f16_s884gemm_planar_complex_array_f16_gemm_operations.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_nn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/all_sm70_f16_s884gemm_planar_complex_f16_gemm_operations.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_nn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_cn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_objs.dir/generated/gemm/70/h884gemm/all_sm70_h884gemm_gemm_operations.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_objs.dir/generated/gemm/70/h884gemm/cutlass_tensorop_h884gemm_256x128_32x2_nn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_cn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_nc_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_objs.dir/generated/gemm/70/h884gemm/cutlass_tensorop_h884gemm_256x128_32x2_nt_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_nc_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_cc_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_objs.dir/generated/gemm/70/h884gemm/cutlass_tensorop_h884gemm_256x128_32x2_tn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_cc_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_nt_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_fp8in_fp16out.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_objs.dir/generated/gemm/70/h884gemm/cutlass_tensorop_h884gemm_256x128_32x2_tt_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_nt_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_ct_align8.cu.o [ 1%] Built target cutlass_library_gemm_sm70_h884gemm_objs [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_nh_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_ct_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_fp8in_bf16out.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_ch_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_nh_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_tn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_ch_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_hn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_tn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_tc_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_hn_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_hc_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_tc_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_tt_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_hc_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_ht_align8.cu.o [ 1%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_tt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_ht_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_th_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_th_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_array_f16/cutlass_tensorop_f16_s884gemm_planar_complex_array_f16_64x64_32x2_hh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/f16_s884gemm_planar_complex_f16/cutlass_tensorop_f16_s884gemm_planar_complex_f16_64x64_32x2_hh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/all_sm70_h884gemm_planar_complex_gemm_operations.cu.o [ 2%] Built target cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_objs [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_nn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_fp8in_fp32out.cu.o [ 2%] Built target cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_objs [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_cn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_fp32out.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_nc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/all_sm70_h884gemm_planar_complex_array_gemm_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_nn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_cc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_cn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_nt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_nc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_ct_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_nh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_cc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_ch_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_nt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_tn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_ct_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_hn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_nh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_tc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_ch_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_hc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_f16_objs.dir/generated/gemm/70/s884gemm_f16/all_sm70_s884gemm_f16_gemm_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_fp_other.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_tn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_f16_objs.dir/generated/gemm/70/s884gemm_f16/cutlass_tensorop_s884gemm_f16_256x128_32x2_nn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_tt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_f16_objs.dir/generated/gemm/70/s884gemm_f16/cutlass_tensorop_s884gemm_f16_256x128_32x2_nt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_hn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_ht_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_f16_objs.dir/generated/gemm/70/s884gemm_f16/cutlass_tensorop_s884gemm_f16_256x128_32x2_tn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_tc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_th_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_f16_objs.dir/generated/gemm/70/s884gemm_f16/cutlass_tensorop_s884gemm_f16_256x128_32x2_tt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_objs.dir/generated/gemm/70/h884gemm_planar_complex/cutlass_tensorop_h884gemm_planar_complex_64x64_32x2_hh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_hc_align8.cu.o [ 2%] Built target cutlass_library_gemm_sm70_s884gemm_f16_objs [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_fp_mixed_input.cu.o [ 2%] Built target cutlass_library_gemm_sm70_h884gemm_planar_complex_objs [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_tt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/gemm_int_mixed_input.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_ht_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_th_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs.dir/generated/gemm/70/h884gemm_planar_complex_array/cutlass_tensorop_h884gemm_planar_complex_array_64x64_32x2_hh_align8.cu.o [ 2%] Built target cutlass_library_gemm_sm70_h884gemm_planar_complex_array_objs [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/initialize_reference_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/all_sm70_s884gemm_planar_complex_array_f16_gemm_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_nn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_cn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/all_sm70_s884gemm_planar_complex_f16_gemm_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_nn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_nc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_cn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_cc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_nc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_nt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_cc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_ct_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_nt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_nh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_ct_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_f16_objs.dir/generated/gemm/75/f16_s1688gemm_f16/all_sm75_f16_s1688gemm_f16_gemm_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_ch_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_f16_objs.dir/generated/gemm/75/f16_s1688gemm_f16/cutlass_tensorop_f16_s1688gemm_f16_256x128_32x2_nn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_nh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_tn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_f16_objs.dir/generated/gemm/75/f16_s1688gemm_f16/cutlass_tensorop_f16_s1688gemm_f16_256x128_32x2_nt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_ch_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_hn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_f16_objs.dir/generated/gemm/75/f16_s1688gemm_f16/cutlass_tensorop_f16_s1688gemm_f16_256x128_32x2_tn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_tn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_tc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_f16_objs.dir/generated/gemm/75/f16_s1688gemm_f16/cutlass_tensorop_f16_s1688gemm_f16_256x128_32x2_tt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_hn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_hc_align8.cu.o [ 2%] Built target cutlass_library_gemm_sm75_f16_s1688gemm_f16_objs [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reduction/reduction_device.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_tc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_tt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_hc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/all_sm75_f16_s1688gemm_planar_complex_array_f16_gemm_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_ht_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_nn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_tt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_th_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_cn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_ht_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_array_f16/cutlass_tensorop_s884gemm_planar_complex_array_f16_64x64_32x2_hh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_th_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_nc_align8.cu.o [ 2%] Built target cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_objs [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs.dir/generated/gemm/70/s884gemm_planar_complex_f16/cutlass_tensorop_s884gemm_planar_complex_f16_64x64_32x2_hh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reduction/init_reduction_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_cc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/all_sm75_f16_s1688gemm_planar_complex_f16_gemm_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_nn_align8.cu.o [ 2%] Built target cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_objs [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_nt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/conv2d.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_cn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_ct_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_nc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_nh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_objs.dir/generated/gemm/75/h1688gemm/all_sm75_h1688gemm_gemm_operations.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_cc_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_objs.dir/generated/gemm/75/h1688gemm/cutlass_tensorop_h1688gemm_256x128_32x2_nn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_ch_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_nt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_objs.dir/generated/gemm/75/h1688gemm/cutlass_tensorop_h1688gemm_256x128_32x2_nt_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_tn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_ct_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_objs.dir/generated/gemm/75/h1688gemm/cutlass_tensorop_h1688gemm_256x128_32x2_tn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_hn_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_nh_align8.cu.o [ 2%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_objs.dir/generated/gemm/75/h1688gemm/cutlass_tensorop_h1688gemm_256x128_32x2_tt_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_tc_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/src/reference/conv3d.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_ch_align8.cu.o [ 3%] Built target cutlass_library_gemm_sm75_h1688gemm_objs [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_hc_align8.cu.o [ 3%] Building CXX object tools/library/CMakeFiles/cutlass_library_objs.dir/generated/initialize_all.cpp.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/all_sm75_h1688gemm_planar_complex_gemm_operations.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_nn_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_tn_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_tt_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/generated/gemm/all_gemm_operations.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_cn_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_hn_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_ht_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/generated/conv2d/all_conv2d_operations.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/generated/conv3d/all_conv3d_operations.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/generated/rank_k/all_rank_k_operations.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_tc_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_nc_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/generated/rank_2k/all_rank_2k_operations.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_th_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/generated/trmm/all_trmm_operations.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_hc_align8.cu.o [ 3%] Building CUDA object tools/library/CMakeFiles/cutlass_library_objs.dir/generated/symm/all_symm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_cc_align8.cu.o [ 4%] Built target cutlass_library_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_array_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_array_f16_64x128_32x2_hh_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_tt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_nt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/all_sm75_h1688gemm_planar_complex_array_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_nn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_ht_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_cn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_ct_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_th_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_nh_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_nc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i88128xorgemm_b1_objs.dir/generated/gemm/75/i88128xorgemm_b1/all_sm75_i88128xorgemm_b1_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i88128xorgemm_b1_objs.dir/generated/gemm/75/i88128xorgemm_b1/cutlass_tensorop_i88128xorgemm_b1_256x128_512x2_tn_align128.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/f16_s1688gemm_planar_complex_f16/cutlass_tensorop_f16_s1688gemm_planar_complex_f16_64x128_32x2_hh_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_ch_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_cc_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm75_i88128xorgemm_b1_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_tn_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_nt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_ct_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i8816gemm_s8_objs.dir/generated/gemm/75/i8816gemm_s8/all_sm75_i8816gemm_s8_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i8816gemm_s8_objs.dir/generated/gemm/75/i8816gemm_s8/cutlass_tensorop_i8816gemm_s8_256x128_64x2_tn_align16.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_hn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_tc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_nh_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i8816gemm_u8_objs.dir/generated/gemm/75/i8816gemm_u8/all_sm75_i8816gemm_u8_gemm_operations.cu.o [ 4%] Built target cutlass_library_gemm_sm75_i8816gemm_s8_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_ch_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_hc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i8816gemm_u8_objs.dir/generated/gemm/75/i8816gemm_u8/cutlass_tensorop_i8816gemm_u8_256x128_64x2_tn_align16.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_tt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_tn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_hn_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm75_i8816gemm_u8_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_tc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_ht_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_th_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i8832gemm_s4_objs.dir/generated/gemm/75/i8832gemm_s4/all_sm75_i8832gemm_s4_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_hc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i8832gemm_u4_objs.dir/generated/gemm/75/i8832gemm_u4/all_sm75_i8832gemm_u4_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i8832gemm_s4_objs.dir/generated/gemm/75/i8832gemm_s4/cutlass_tensorop_i8832gemm_s4_256x128_128x2_tn_align32.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_i8832gemm_u4_objs.dir/generated/gemm/75/i8832gemm_u4/cutlass_tensorop_i8832gemm_u4_256x128_128x2_tn_align32.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs.dir/generated/gemm/75/h1688gemm_planar_complex/cutlass_tensorop_h1688gemm_planar_complex_64x128_32x2_hh_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_tt_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm75_i8832gemm_s4_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_f16_objs.dir/generated/gemm/75/s1688gemm_f16/all_sm75_s1688gemm_f16_gemm_operations.cu.o [ 4%] Built target cutlass_library_gemm_sm75_i8832gemm_u4_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_ht_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_f16_objs.dir/generated/gemm/75/s1688gemm_f16/cutlass_tensorop_s1688gemm_f16_256x128_32x2_nn_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm75_h1688gemm_planar_complex_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_th_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_f16_objs.dir/generated/gemm/75/s1688gemm_f16/cutlass_tensorop_s1688gemm_f16_256x128_32x2_nt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs.dir/generated/gemm/75/h1688gemm_planar_complex_array/cutlass_tensorop_h1688gemm_planar_complex_array_64x128_32x2_hh_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/all_sm75_s1688gemm_planar_complex_array_f16_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_nn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_cn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_f16_objs.dir/generated/gemm/75/s1688gemm_f16/cutlass_tensorop_s1688gemm_f16_256x128_32x2_tn_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_nc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_f16_objs.dir/generated/gemm/75/s1688gemm_f16/cutlass_tensorop_s1688gemm_f16_256x128_32x2_tt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/all_sm75_s1688gemm_planar_complex_f16_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s4_i8832gemm_s4_objs.dir/generated/gemm/75/s4_i8832gemm_s4/all_sm75_s4_i8832gemm_s4_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_nn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s4_i8832gemm_s4_objs.dir/generated/gemm/75/s4_i8832gemm_s4/cutlass_tensorop_s4_i8832gemm_s4_256x128_128x2_tn_align32.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_cc_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm75_s1688gemm_f16_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_cn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s4_i8832gemm_s4_objs.dir/generated/gemm/75/s4_i8832gemm_s4/cutlass_tensorop_s4_i8832gemm_s4_256x128_128x2_n64t64_align32.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_nt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s8_i8816gemm_s8_objs.dir/generated/gemm/75/s8_i8816gemm_s8/all_sm75_s8_i8816gemm_s8_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_nc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s8_i8816gemm_s8_objs.dir/generated/gemm/75/s8_i8816gemm_s8/cutlass_tensorop_s8_i8816gemm_s8_256x128_64x2_tn_align16.cu.o [ 4%] Built target cutlass_library_gemm_sm75_s4_i8832gemm_s4_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_ct_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_u4_i8832gemm_u4_objs.dir/generated/gemm/75/u4_i8832gemm_u4/all_sm75_u4_i8832gemm_u4_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_cc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_u4_i8832gemm_u4_objs.dir/generated/gemm/75/u4_i8832gemm_u4/cutlass_tensorop_u4_i8832gemm_u4_256x128_128x2_tn_align32.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s8_i8816gemm_s8_objs.dir/generated/gemm/75/s8_i8816gemm_s8/cutlass_tensorop_s8_i8816gemm_s8_256x128_64x2_n32t32_align16.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_nh_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_nt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_u4_i8832gemm_u4_objs.dir/generated/gemm/75/u4_i8832gemm_u4/cutlass_tensorop_u4_i8832gemm_u4_256x128_128x2_n64t64_align32.cu.o [ 4%] Built target cutlass_library_gemm_sm75_s8_i8816gemm_s8_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_ct_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_ch_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_u8_i8816gemm_u8_objs.dir/generated/gemm/75/u8_i8816gemm_u8/all_sm75_u8_i8816gemm_u8_gemm_operations.cu.o [ 4%] Built target cutlass_library_gemm_sm75_u4_i8832gemm_u4_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_tn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_nh_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_u8_i8816gemm_u8_objs.dir/generated/gemm/75/u8_i8816gemm_u8/cutlass_tensorop_u8_i8816gemm_u8_256x128_64x2_tn_align16.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_ch_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_hn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_bf16/all_sm80_bf16_s16816gemm_bf16_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_u8_i8816gemm_u8_objs.dir/generated/gemm/75/u8_i8816gemm_u8/cutlass_tensorop_u8_i8816gemm_u8_256x128_64x2_n32t32_align16.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_tn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_bf16/cutlass_tensorop_bf16_s16816gemm_bf16_256x128_32x3_nn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_tc_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm75_u8_i8816gemm_u8_objs [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_hn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_bf16/cutlass_tensorop_bf16_s16816gemm_bf16_256x128_32x3_nt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_hc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_s8_objs.dir/generated/gemm/80/bf16_s16816gemm_bf16_s8/all_sm80_bf16_s16816gemm_bf16_s8_gemm_operations.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_tc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_s8_objs.dir/generated/gemm/80/bf16_s16816gemm_bf16_s8/cutlass_tensorop_bf16_s16816gemm_bf16_s8_128x128_64x4_tn_align16.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_bf16/cutlass_tensorop_bf16_s16816gemm_bf16_256x128_32x3_tn_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_tt_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_hc_align8.cu.o [ 4%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_bf16/cutlass_tensorop_bf16_s16816gemm_bf16_256x128_32x3_tt_align8.cu.o [ 4%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_s8_objs [ 5%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_ht_align8.cu.o [ 5%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_u8_objs.dir/generated/gemm/80/bf16_s16816gemm_bf16_u8/all_sm80_bf16_s16816gemm_bf16_u8_gemm_operations.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_u8_objs.dir/generated/gemm/80/bf16_s16816gemm_bf16_u8/cutlass_tensorop_bf16_s16816gemm_bf16_u8_128x128_64x4_tn_align16.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_tt_align8.cu.o [ 6%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_objs [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_th_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/all_sm80_bf16_s16816gemm_planar_complex_array_bf16_gemm_operations.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_nn_align8.cu.o [ 6%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_u8_objs [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_array_f16/cutlass_tensorop_s1688gemm_planar_complex_array_f16_64x128_32x2_hh_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_ht_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_cn_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/all_sm80_bf16_s16816gemm_planar_complex_bf16_gemm_operations.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_th_align8.cu.o [ 6%] Built target cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_objs [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs.dir/generated/gemm/75/s1688gemm_planar_complex_f16/cutlass_tensorop_s1688gemm_planar_complex_f16_64x128_32x2_hh_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_nn_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_nc_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_cn_align8.cu.o [ 6%] Built target cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_objs [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_cc_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_s8_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_s8_bf16/all_sm80_bf16_s16816gemm_s8_bf16_gemm_operations.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_s8_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_s8_bf16/cutlass_tensorop_bf16_s16816gemm_s8_bf16_128x128_64x4_tn_align16.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_nc_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_nt_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_u8_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_u8_bf16/all_sm80_bf16_s16816gemm_u8_bf16_gemm_operations.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_u8_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_u8_bf16/cutlass_tensorop_bf16_s16816gemm_u8_bf16_128x128_64x4_tn_align16.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_cc_align8.cu.o [ 6%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_s8_bf16_objs [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_nt_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_ct_align8.cu.o [ 6%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_u8_bf16_objs [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_ct_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_nh_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16832spgemm_bf16_objs.dir/generated/gemm/80/bf16_s16832spgemm_bf16/all_sm80_bf16_s16832spgemm_bf16_gemm_operations.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16832spgemm_bf16_objs.dir/generated/gemm/80/bf16_s16832spgemm_bf16/cutlass_tensorop_bf16_s16832spgemm_bf16_64x128_64x6_nn_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_nh_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16832spgemm_bf16_objs.dir/generated/gemm/80/bf16_s16832spgemm_bf16/cutlass_tensorop_bf16_s16832spgemm_bf16_64x128_64x6_nt_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_ch_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_ch_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_tn_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16832spgemm_bf16_objs.dir/generated/gemm/80/bf16_s16832spgemm_bf16/cutlass_tensorop_bf16_s16832spgemm_bf16_64x128_64x6_tn_align8.cu.o [ 6%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/all_sm80_c1688gemm_gemm_operations.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_nn_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_tn_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_hn_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16832spgemm_bf16_objs.dir/generated/gemm/80/bf16_s16832spgemm_bf16/cutlass_tensorop_bf16_s16832spgemm_bf16_64x128_64x6_tt_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_hn_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_cn_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_tc_align8.cu.o [ 7%] Built target cutlass_library_gemm_sm80_bf16_s16832spgemm_bf16_objs [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_tc_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_nc_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_hc_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_tt_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_hc_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_cc_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_nt_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_ht_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_tt_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/all_sm80_c1688tf32gemm_gemm_operations.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_ct_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_th_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_nn_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_ht_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_array_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_array_bf16_64x128_32x3_hh_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_nh_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_cn_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_th_align8.cu.o [ 7%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_objs [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_ch_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_nc_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/bf16_s16816gemm_planar_complex_bf16/cutlass_tensorop_bf16_s16816gemm_planar_complex_bf16_64x128_32x3_hh_align8.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/all_sm80_cgemm_gemm_operations.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_nn_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_tn_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_cc_align1.cu.o [ 7%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_objs [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_hn_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_cn_align1.cu.o [ 7%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_nt_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_d884gemm_objs.dir/generated/gemm/80/d884gemm/all_sm80_d884gemm_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_tc_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_d884gemm_objs.dir/generated/gemm/80/d884gemm/cutlass_tensorop_d884gemm_128x128_16x3_nn_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_ct_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_nc_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_hc_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_d884gemm_objs.dir/generated/gemm/80/d884gemm/cutlass_tensorop_d884gemm_128x128_16x3_nt_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_nh_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_cc_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_d884gemm_objs.dir/generated/gemm/80/d884gemm/cutlass_tensorop_d884gemm_128x128_16x3_tn_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_tt_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_ch_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_d884gemm_objs.dir/generated/gemm/80/d884gemm/cutlass_tensorop_d884gemm_128x128_16x3_tt_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_ht_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_nt_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_tn_align1.cu.o [ 8%] Built target cutlass_library_gemm_sm80_d884gemm_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_hn_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_th_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_ct_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688gemm_objs.dir/generated/gemm/80/c1688gemm/cutlass_tensorop_c1688gemm_128x64_16x3_hh_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_tc_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_hc_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_nh_align1.cu.o [ 8%] Built target cutlass_library_gemm_sm80_c1688gemm_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_tt_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_ch_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_dgemm_objs.dir/generated/gemm/80/dgemm/all_sm80_dgemm_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_dgemm_objs.dir/generated/gemm/80/dgemm/cutlass_simt_dgemm_128x128_8x3_nn_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_dgemm_objs.dir/generated/gemm/80/dgemm/cutlass_simt_dgemm_128x128_8x3_nt_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_ht_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_tn_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_hn_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_dgemm_objs.dir/generated/gemm/80/dgemm/cutlass_simt_dgemm_128x128_8x3_tn_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_th_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_f16_objs.dir/generated/gemm/80/f16_s16816gemm_f16/all_sm80_f16_s16816gemm_f16_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_f16_objs.dir/generated/gemm/80/f16_s16816gemm_f16/cutlass_tensorop_f16_s16816gemm_f16_256x128_32x3_nn_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_tc_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_dgemm_objs.dir/generated/gemm/80/dgemm/cutlass_simt_dgemm_128x128_8x3_tt_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_c1688tf32gemm_objs.dir/generated/gemm/80/c1688tf32gemm/cutlass_tensorop_c1688tf32gemm_128x128_16x4_hh_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_f16_objs.dir/generated/gemm/80/f16_s16816gemm_f16/cutlass_tensorop_f16_s16816gemm_f16_256x128_32x3_nt_align8.cu.o [ 8%] Built target cutlass_library_gemm_sm80_dgemm_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_hc_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_f16_objs.dir/generated/gemm/80/f16_s16816gemm_f16/cutlass_tensorop_f16_s16816gemm_f16_256x128_32x3_tn_align8.cu.o [ 8%] Built target cutlass_library_gemm_sm80_c1688tf32gemm_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_f16_objs.dir/generated/gemm/80/f16_s16816gemm_f16/cutlass_tensorop_f16_s16816gemm_f16_256x128_32x3_tt_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_tt_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_ht_align1.cu.o [ 8%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_f16_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_th_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_f16_s8_objs.dir/generated/gemm/80/f16_s16816gemm_f16_s8/all_sm80_f16_s16816gemm_f16_s8_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_f16_s8_objs.dir/generated/gemm/80/f16_s16816gemm_f16_s8/cutlass_tensorop_f16_s16816gemm_f16_s8_128x128_64x4_tn_align16.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_f16_u8_objs.dir/generated/gemm/80/f16_s16816gemm_f16_u8/all_sm80_f16_s16816gemm_f16_u8_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_f16_u8_objs.dir/generated/gemm/80/f16_s16816gemm_f16_u8/cutlass_tensorop_f16_s16816gemm_f16_u8_128x128_64x4_tn_align16.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/all_sm80_f16_s16816gemm_planar_complex_array_f16_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_cgemm_objs.dir/generated/gemm/80/cgemm/cutlass_simt_cgemm_128x128_8x5_hh_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_nn_align8.cu.o [ 8%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_f16_s8_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_cn_align8.cu.o [ 8%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_f16_u8_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_nc_align8.cu.o [ 8%] Built target cutlass_library_gemm_sm80_cgemm_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_cc_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/all_sm80_f16_s16816gemm_planar_complex_f16_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_s8_f16_objs.dir/generated/gemm/80/f16_s16816gemm_s8_f16/all_sm80_f16_s16816gemm_s8_f16_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_nn_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_s8_f16_objs.dir/generated/gemm/80/f16_s16816gemm_s8_f16/cutlass_tensorop_f16_s16816gemm_s8_f16_128x128_64x4_tn_align16.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_cn_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_nt_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_u8_f16_objs.dir/generated/gemm/80/f16_s16816gemm_u8_f16/all_sm80_f16_s16816gemm_u8_f16_gemm_operations.cu.o [ 8%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_s8_f16_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_nc_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_u8_f16_objs.dir/generated/gemm/80/f16_s16816gemm_u8_f16/cutlass_tensorop_f16_s16816gemm_u8_f16_128x128_64x4_tn_align16.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_ct_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_cc_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16832spgemm_f16_objs.dir/generated/gemm/80/f16_s16832spgemm_f16/all_sm80_f16_s16832spgemm_f16_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_nh_align8.cu.o [ 8%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_u8_f16_objs [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_ch_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16832spgemm_f16_objs.dir/generated/gemm/80/f16_s16832spgemm_f16/cutlass_tensorop_f16_s16832spgemm_f16_64x128_64x6_nn_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_nt_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16832spgemm_f16_objs.dir/generated/gemm/80/f16_s16832spgemm_f16/cutlass_tensorop_f16_s16832spgemm_f16_64x128_64x6_nt_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_tn_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/all_sm80_gz884gemm_gemm_operations.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_ct_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_nn_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16832spgemm_f16_objs.dir/generated/gemm/80/f16_s16832spgemm_f16/cutlass_tensorop_f16_s16832spgemm_f16_64x128_64x6_tn_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_hn_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_nh_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_cn_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_tc_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16832spgemm_f16_objs.dir/generated/gemm/80/f16_s16832spgemm_f16/cutlass_tensorop_f16_s16832spgemm_f16_64x128_64x6_tt_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_ch_align8.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_nc_align1.cu.o [ 8%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_hc_align8.cu.o [ 8%] Built target cutlass_library_gemm_sm80_f16_s16832spgemm_f16_objs [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_tn_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_cc_align1.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_tt_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_objs.dir/generated/gemm/80/h16816gemm/all_sm80_h16816gemm_gemm_operations.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_nt_align1.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_hn_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_ht_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_objs.dir/generated/gemm/80/h16816gemm/cutlass_tensorop_h16816gemm_256x128_32x3_nn_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_ct_align1.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_tc_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_th_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_objs.dir/generated/gemm/80/h16816gemm/cutlass_tensorop_h16816gemm_256x128_32x3_nt_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_nh_align1.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_hc_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_array_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_array_f16_64x128_32x3_hh_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_objs.dir/generated/gemm/80/h16816gemm/cutlass_tensorop_h16816gemm_256x128_32x3_tn_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_ch_align1.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_tt_align8.cu.o [ 9%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_objs [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_tn_align1.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_objs.dir/generated/gemm/80/h16816gemm/cutlass_tensorop_h16816gemm_256x128_32x3_tt_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_ht_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_f16_s8_objs.dir/generated/gemm/80/h16816gemm_f16_s8/all_sm80_h16816gemm_f16_s8_gemm_operations.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_hn_align1.cu.o [ 9%] Built target cutlass_library_gemm_sm80_h16816gemm_objs [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_tc_align1.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_f16_s8_objs.dir/generated/gemm/80/h16816gemm_f16_s8/cutlass_tensorop_h16816gemm_f16_s8_128x128_64x4_tn_align16.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_th_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_f16_u8_objs.dir/generated/gemm/80/h16816gemm_f16_u8/all_sm80_h16816gemm_f16_u8_gemm_operations.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_hc_align1.cu.o [ 9%] Built target cutlass_library_gemm_sm80_h16816gemm_f16_s8_objs [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/f16_s16816gemm_planar_complex_f16/cutlass_tensorop_f16_s16816gemm_planar_complex_f16_64x128_32x3_hh_align8.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_f16_u8_objs.dir/generated/gemm/80/h16816gemm_f16_u8/cutlass_tensorop_h16816gemm_f16_u8_128x128_64x4_tn_align16.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_grouped_objs.dir/generated/gemm/80/h16816gemm_grouped/all_sm80_h16816gemm_grouped_gemm_operations.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_grouped_objs.dir/generated/gemm/80/h16816gemm_grouped/cutlass_tensorop_h16816gemm_grouped_256x128_32x3_nn_align8_scheduleDevice.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_tt_align1.cu.o [ 9%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_objs [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_grouped_objs.dir/generated/gemm/80/h16816gemm_grouped/cutlass_tensorop_h16816gemm_grouped_256x128_32x3_nt_align8_scheduleDevice.cu.o [ 9%] Built target cutlass_library_gemm_sm80_h16816gemm_f16_u8_objs [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_ht_align1.cu.o [ 9%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/all_sm80_h16816gemm_planar_complex_gemm_operations.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_grouped_objs.dir/generated/gemm/80/h16816gemm_grouped/cutlass_tensorop_h16816gemm_grouped_256x128_32x3_tn_align8_scheduleDevice.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_nn_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/all_sm80_h16816gemm_planar_complex_array_gemm_operations.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_th_align1.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_nn_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_grouped_objs.dir/generated/gemm/80/h16816gemm_grouped/cutlass_tensorop_h16816gemm_grouped_256x128_32x3_tt_align8_scheduleDevice.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_cn_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_gz884gemm_objs.dir/generated/gemm/80/gz884gemm/cutlass_tensorop_gz884gemm_64x64_8x3_hh_align1.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_cn_align8.cu.o [ 10%] Built target cutlass_library_gemm_sm80_h16816gemm_grouped_objs [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_nc_align8.cu.o [ 10%] Built target cutlass_library_gemm_sm80_gz884gemm_objs [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_nc_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_cc_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_s8_f16_objs.dir/generated/gemm/80/h16816gemm_s8_f16/all_sm80_h16816gemm_s8_f16_gemm_operations.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_cc_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_s8_f16_objs.dir/generated/gemm/80/h16816gemm_s8_f16/cutlass_tensorop_h16816gemm_s8_f16_128x128_64x4_tn_align16.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_u8_f16_objs.dir/generated/gemm/80/h16816gemm_u8_f16/all_sm80_h16816gemm_u8_f16_gemm_operations.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_nt_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_u8_f16_objs.dir/generated/gemm/80/h16816gemm_u8_f16/cutlass_tensorop_h16816gemm_u8_f16_128x128_64x4_tn_align16.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_nt_align8.cu.o [ 10%] Built target cutlass_library_gemm_sm80_h16816gemm_s8_f16_objs [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_ct_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_ct_align8.cu.o [ 10%] Built target cutlass_library_gemm_sm80_h16816gemm_u8_f16_objs [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_nh_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_nh_align8.cu.o [ 10%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16832spgemm_objs.dir/generated/gemm/80/h16832spgemm/all_sm80_h16832spgemm_gemm_operations.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16832spgemm_objs.dir/generated/gemm/80/h16832spgemm/cutlass_tensorop_h16832spgemm_64x128_64x6_nn_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_ch_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_ch_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_tn_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16832spgemm_objs.dir/generated/gemm/80/h16832spgemm/cutlass_tensorop_h16832spgemm_64x128_64x6_nt_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_tn_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_hn_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_hn_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16832spgemm_objs.dir/generated/gemm/80/h16832spgemm/cutlass_tensorop_h16832spgemm_64x128_64x6_tn_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i168128spgemm_s4_objs.dir/generated/gemm/80/i168128spgemm_s4/all_sm80_i168128spgemm_s4_gemm_operations.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_tc_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i168128spgemm_s4_objs.dir/generated/gemm/80/i168128spgemm_s4/cutlass_tensorop_i168128spgemm_s4_64x64_256x4_tn_align32.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_tc_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16832spgemm_objs.dir/generated/gemm/80/h16832spgemm/cutlass_tensorop_h16832spgemm_64x128_64x6_tt_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_hc_align8.cu.o ptxas , line 3; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas , line 3; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures [ 11%] Built target cutlass_library_gemm_sm80_i168128spgemm_s4_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_hc_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_tt_align8.cu.o [ 11%] Built target cutlass_library_gemm_sm80_h16832spgemm_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_tt_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i168256andgemm_b1_objs.dir/generated/gemm/80/i168256andgemm_b1/all_sm80_i168256andgemm_b1_gemm_operations.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i168256xorgemm_b1_objs.dir/generated/gemm/80/i168256xorgemm_b1/all_sm80_i168256xorgemm_b1_gemm_operations.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_ht_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i168256andgemm_b1_objs.dir/generated/gemm/80/i168256andgemm_b1/cutlass_tensorop_i168256andgemm_b1_256x128_512x3_tn_align128.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i168256xorgemm_b1_objs.dir/generated/gemm/80/i168256xorgemm_b1/cutlass_tensorop_i168256xorgemm_b1_256x128_512x3_tn_align128.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_ht_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_th_align8.cu.o [ 11%] Built target cutlass_library_gemm_sm80_i168256andgemm_b1_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_th_align8.cu.o [ 11%] Built target cutlass_library_gemm_sm80_i168256xorgemm_b1_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs.dir/generated/gemm/80/h16816gemm_planar_complex_array/cutlass_tensorop_h16816gemm_planar_complex_array_64x128_32x3_hh_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16832gemm_s4_s8_objs.dir/generated/gemm/80/i16832gemm_s4_s8/all_sm80_i16832gemm_s4_s8_gemm_operations.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16832gemm_s4_s8_objs.dir/generated/gemm/80/i16832gemm_s4_s8/cutlass_tensorop_i16832gemm_s4_s8_256x128_64x3_tn_align32.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs.dir/generated/gemm/80/h16816gemm_planar_complex/cutlass_tensorop_h16816gemm_planar_complex_64x128_32x3_hh_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16832gemm_s8_objs.dir/generated/gemm/80/i16832gemm_s8/all_sm80_i16832gemm_s8_gemm_operations.cu.o [ 11%] Built target cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16832gemm_s8_objs.dir/generated/gemm/80/i16832gemm_s8/cutlass_tensorop_i16832gemm_s8_256x128_64x3_tn_align16.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16832gemm_s8_s4_objs.dir/generated/gemm/80/i16832gemm_s8_s4/all_sm80_i16832gemm_s8_s4_gemm_operations.cu.o [ 11%] Built target cutlass_library_gemm_sm80_i16832gemm_s4_s8_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16832gemm_s8_s4_objs.dir/generated/gemm/80/i16832gemm_s8_s4/cutlass_tensorop_i16832gemm_s8_s4_256x128_64x3_tn_align32.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16832gemm_u8_objs.dir/generated/gemm/80/i16832gemm_u8/all_sm80_i16832gemm_u8_gemm_operations.cu.o [ 11%] Built target cutlass_library_gemm_sm80_h16816gemm_planar_complex_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16832gemm_u8_objs.dir/generated/gemm/80/i16832gemm_u8/cutlass_tensorop_i16832gemm_u8_256x128_64x3_tn_align16.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16864gemm_s4_objs.dir/generated/gemm/80/i16864gemm_s4/all_sm80_i16864gemm_s4_gemm_operations.cu.o [ 11%] Built target cutlass_library_gemm_sm80_i16832gemm_s8_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16864gemm_s4_objs.dir/generated/gemm/80/i16864gemm_s4/cutlass_tensorop_i16864gemm_s4_256x128_128x3_tn_align32.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16864gemm_u4_objs.dir/generated/gemm/80/i16864gemm_u4/all_sm80_i16864gemm_u4_gemm_operations.cu.o [ 11%] Built target cutlass_library_gemm_sm80_i16832gemm_s8_s4_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16864gemm_u4_objs.dir/generated/gemm/80/i16864gemm_u4/cutlass_tensorop_i16864gemm_u4_256x128_128x3_tn_align32.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16864spgemm_s8_objs.dir/generated/gemm/80/i16864spgemm_s8/all_sm80_i16864spgemm_s8_gemm_operations.cu.o [ 11%] Built target cutlass_library_gemm_sm80_i16832gemm_u8_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_i16864spgemm_s8_objs.dir/generated/gemm/80/i16864spgemm_s8/cutlass_tensorop_i16864spgemm_s8_128x64_128x3_tn_align16.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_bf16_objs.dir/generated/gemm/80/s16816gemm_bf16/all_sm80_s16816gemm_bf16_gemm_operations.cu.o [ 11%] Built target cutlass_library_gemm_sm80_i16864gemm_s4_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_bf16_objs.dir/generated/gemm/80/s16816gemm_bf16/cutlass_tensorop_s16816gemm_bf16_256x128_32x3_nn_align8.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_bf16_s8_objs.dir/generated/gemm/80/s16816gemm_bf16_s8/all_sm80_s16816gemm_bf16_s8_gemm_operations.cu.o [ 11%] Built target cutlass_library_gemm_sm80_i16864gemm_u4_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_bf16_s8_objs.dir/generated/gemm/80/s16816gemm_bf16_s8/cutlass_tensorop_s16816gemm_bf16_s8_128x128_64x4_tn_align16.cu.o [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_bf16_objs.dir/generated/gemm/80/s16816gemm_bf16/cutlass_tensorop_s16816gemm_bf16_256x128_32x3_nt_align8.cu.o [ 11%] Built target cutlass_library_gemm_sm80_i16864spgemm_s8_objs [ 11%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_bf16_objs.dir/generated/gemm/80/s16816gemm_bf16/cutlass_tensorop_s16816gemm_bf16_256x128_32x3_tn_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_bf16_u8_objs.dir/generated/gemm/80/s16816gemm_bf16_u8/all_sm80_s16816gemm_bf16_u8_gemm_operations.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_bf16_u8_objs.dir/generated/gemm/80/s16816gemm_bf16_u8/cutlass_tensorop_s16816gemm_bf16_u8_128x128_64x4_tn_align16.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_f16_objs.dir/generated/gemm/80/s16816gemm_f16/all_sm80_s16816gemm_f16_gemm_operations.cu.o [ 12%] Built target cutlass_library_gemm_sm80_s16816gemm_bf16_s8_objs [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_bf16_objs.dir/generated/gemm/80/s16816gemm_bf16/cutlass_tensorop_s16816gemm_bf16_256x128_32x3_tt_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_f16_objs.dir/generated/gemm/80/s16816gemm_f16/cutlass_tensorop_s16816gemm_f16_256x128_32x3_nn_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_f16_s8_objs.dir/generated/gemm/80/s16816gemm_f16_s8/all_sm80_s16816gemm_f16_s8_gemm_operations.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_f16_s8_objs.dir/generated/gemm/80/s16816gemm_f16_s8/cutlass_tensorop_s16816gemm_f16_s8_128x128_64x4_tn_align16.cu.o [ 12%] Built target cutlass_library_gemm_sm80_s16816gemm_bf16_u8_objs [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_f16_objs.dir/generated/gemm/80/s16816gemm_f16/cutlass_tensorop_s16816gemm_f16_256x128_32x3_nt_align8.cu.o [ 12%] Built target cutlass_library_gemm_sm80_s16816gemm_bf16_objs [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_f16_objs.dir/generated/gemm/80/s16816gemm_f16/cutlass_tensorop_s16816gemm_f16_256x128_32x3_tn_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_f16_u8_objs.dir/generated/gemm/80/s16816gemm_f16_u8/all_sm80_s16816gemm_f16_u8_gemm_operations.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_f16_u8_objs.dir/generated/gemm/80/s16816gemm_f16_u8/cutlass_tensorop_s16816gemm_f16_u8_128x128_64x4_tn_align16.cu.o [ 12%] Built target cutlass_library_gemm_sm80_s16816gemm_f16_s8_objs [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_f16_objs.dir/generated/gemm/80/s16816gemm_f16/cutlass_tensorop_s16816gemm_f16_256x128_32x3_tt_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_bf16_objs.dir/generated/gemm/80/s16816gemm_grouped_bf16/all_sm80_s16816gemm_grouped_bf16_gemm_operations.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_f16_objs.dir/generated/gemm/80/s16816gemm_grouped_f16/all_sm80_s16816gemm_grouped_f16_gemm_operations.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_bf16_objs.dir/generated/gemm/80/s16816gemm_grouped_bf16/cutlass_tensorop_s16816gemm_grouped_bf16_256x128_32x3_nn_align8_scheduleDevice.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_f16_objs.dir/generated/gemm/80/s16816gemm_grouped_f16/cutlass_tensorop_s16816gemm_grouped_f16_256x128_32x3_nn_align8_scheduleDevice.cu.o [ 12%] Built target cutlass_library_gemm_sm80_s16816gemm_f16_u8_objs [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_bf16_objs.dir/generated/gemm/80/s16816gemm_grouped_bf16/cutlass_tensorop_s16816gemm_grouped_bf16_256x128_32x3_nt_align8_scheduleDevice.cu.o [ 12%] Built target cutlass_library_gemm_sm80_s16816gemm_f16_objs [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_f16_objs.dir/generated/gemm/80/s16816gemm_grouped_f16/cutlass_tensorop_s16816gemm_grouped_f16_256x128_32x3_nt_align8_scheduleDevice.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_f16_objs.dir/generated/gemm/80/s16816gemm_grouped_f16/cutlass_tensorop_s16816gemm_grouped_f16_256x128_32x3_tn_align8_scheduleDevice.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/all_sm80_s16816gemm_planar_complex_array_bf16_gemm_operations.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_nn_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_bf16_objs.dir/generated/gemm/80/s16816gemm_grouped_bf16/cutlass_tensorop_s16816gemm_grouped_bf16_256x128_32x3_tn_align8_scheduleDevice.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/all_sm80_s16816gemm_planar_complex_array_f16_gemm_operations.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_nn_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_f16_objs.dir/generated/gemm/80/s16816gemm_grouped_f16/cutlass_tensorop_s16816gemm_grouped_f16_256x128_32x3_tt_align8_scheduleDevice.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_cn_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_grouped_bf16_objs.dir/generated/gemm/80/s16816gemm_grouped_bf16/cutlass_tensorop_s16816gemm_grouped_bf16_256x128_32x3_tt_align8_scheduleDevice.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_cn_align8.cu.o [ 12%] Built target cutlass_library_gemm_sm80_s16816gemm_grouped_f16_objs [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_nc_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_nc_align8.cu.o [ 12%] Built target cutlass_library_gemm_sm80_s16816gemm_grouped_bf16_objs [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_cc_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_nt_align8.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/all_sm90_void_i64x128x64spgemm_s8_gemm_operations.cu.o [ 12%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_cc_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/all_sm80_s16816gemm_planar_complex_bf16_gemm_operations.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_nn_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_ct_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_nt_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_cn_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_nh_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_ct_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_nc_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_ch_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_nh_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_cc_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_tn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_ch_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_nt_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_hn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_tn_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_ct_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_tc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_hn_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_nh_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_hc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_tc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_ch_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_tt_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_hc_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_tn_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_ht_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_tt_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_hn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_th_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_ht_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_tc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_bf16/cutlass_tensorop_s16816gemm_planar_complex_array_bf16_64x128_32x3_hh_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_th_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_hc_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 13%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_objs [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_array_f16/cutlass_tensorop_s16816gemm_planar_complex_array_f16_64x128_32x3_hh_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_tt_align8.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_objs [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_ht_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_th_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/all_sm80_s16816gemm_planar_complex_f16_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_nn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_bf16/cutlass_tensorop_s16816gemm_planar_complex_bf16_64x128_32x3_hh_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_s8_bf16_objs.dir/generated/gemm/80/s16816gemm_s8_bf16/all_sm80_s16816gemm_s8_bf16_gemm_operations.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_cn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_s8_bf16_objs.dir/generated/gemm/80/s16816gemm_s8_bf16/cutlass_tensorop_s16816gemm_s8_bf16_128x128_64x4_tn_align16.cu.o [ 13%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_objs [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_nc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Built target cutlass_library_gemm_sm80_s16816gemm_s8_bf16_objs [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_cc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_nt_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 13%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_ct_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_s8_f16_objs.dir/generated/gemm/80/s16816gemm_s8_f16/all_sm80_s16816gemm_s8_f16_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_s8_f16_objs.dir/generated/gemm/80/s16816gemm_s8_f16/cutlass_tensorop_s16816gemm_s8_f16_128x128_64x4_tn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_u8_bf16_objs.dir/generated/gemm/80/s16816gemm_u8_bf16/all_sm80_s16816gemm_u8_bf16_gemm_operations.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_nh_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_u8_bf16_objs.dir/generated/gemm/80/s16816gemm_u8_bf16/cutlass_tensorop_s16816gemm_u8_bf16_128x128_64x4_tn_align16.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 14%] Built target cutlass_library_gemm_sm80_s16816gemm_s8_f16_objs [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_ch_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Built target cutlass_library_gemm_sm80_s16816gemm_u8_bf16_objs [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_tn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_u8_f16_objs.dir/generated/gemm/80/s16816gemm_u8_f16/all_sm80_s16816gemm_u8_f16_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_u8_f16_objs.dir/generated/gemm/80/s16816gemm_u8_f16/cutlass_tensorop_s16816gemm_u8_f16_128x128_64x4_tn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_hn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_tc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816tf32spgemm_objs.dir/generated/gemm/80/s16816tf32spgemm/all_sm80_s16816tf32spgemm_gemm_operations.cu.o [ 14%] Built target cutlass_library_gemm_sm80_s16816gemm_u8_f16_objs [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816tf32spgemm_objs.dir/generated/gemm/80/s16816tf32spgemm/cutlass_tensorop_s16816tf32spgemm_128x64_32x3_nn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_bf16_objs.dir/generated/gemm/80/s16832spgemm_bf16/all_sm80_s16832spgemm_bf16_gemm_operations.cu.o [ 14%] Built target cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_objs [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_bf16_objs.dir/generated/gemm/80/s16832spgemm_bf16/cutlass_tensorop_s16832spgemm_bf16_64x128_64x6_nn_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_hc_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816tf32spgemm_objs.dir/generated/gemm/80/s16816tf32spgemm/cutlass_tensorop_s16816tf32spgemm_128x64_32x3_nt_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_f16_objs.dir/generated/gemm/80/s16832spgemm_f16/all_sm80_s16832spgemm_f16_gemm_operations.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_f16_objs.dir/generated/gemm/80/s16832spgemm_f16/cutlass_tensorop_s16832spgemm_f16_64x128_64x6_nn_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_bf16_objs.dir/generated/gemm/80/s16832spgemm_bf16/cutlass_tensorop_s16832spgemm_bf16_64x128_64x6_nt_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_tt_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816tf32spgemm_objs.dir/generated/gemm/80/s16816tf32spgemm/cutlass_tensorop_s16816tf32spgemm_128x64_32x3_tn_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_f16_objs.dir/generated/gemm/80/s16832spgemm_f16/cutlass_tensorop_s16832spgemm_f16_64x128_64x6_nt_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_ht_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816tf32spgemm_objs.dir/generated/gemm/80/s16816tf32spgemm/cutlass_tensorop_s16816tf32spgemm_128x64_32x3_tt_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_bf16_objs.dir/generated/gemm/80/s16832spgemm_bf16/cutlass_tensorop_s16832spgemm_bf16_64x128_64x6_tn_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_f16_objs.dir/generated/gemm/80/s16832spgemm_f16/cutlass_tensorop_s16832spgemm_f16_64x128_64x6_tn_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_th_align8.cu.o [ 14%] Built target cutlass_library_gemm_sm80_s16816tf32spgemm_objs [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_bf16_objs.dir/generated/gemm/80/s16832spgemm_bf16/cutlass_tensorop_s16832spgemm_bf16_64x128_64x6_tt_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16832spgemm_f16_objs.dir/generated/gemm/80/s16832spgemm_f16/cutlass_tensorop_s16832spgemm_f16_64x128_64x6_tt_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs.dir/generated/gemm/80/s16816gemm_planar_complex_f16/cutlass_tensorop_s16816gemm_planar_complex_f16_64x128_32x3_hh_align8.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688bf16gemm_objs.dir/generated/gemm/80/s1688bf16gemm/all_sm80_s1688bf16gemm_gemm_operations.cu.o [ 14%] Built target cutlass_library_gemm_sm80_s16832spgemm_bf16_objs [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688bf16gemm_objs.dir/generated/gemm/80/s1688bf16gemm/cutlass_tensorop_s1688bf16gemm_256x128_16x3_nn_align4.cu.o [ 14%] Built target cutlass_library_gemm_sm80_s16832spgemm_f16_objs [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688bf16gemm_objs.dir/generated/gemm/80/s1688bf16gemm/cutlass_tensorop_s1688bf16gemm_256x128_16x3_nt_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688f16gemm_objs.dir/generated/gemm/80/s1688f16gemm/all_sm80_s1688f16gemm_gemm_operations.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688f16gemm_objs.dir/generated/gemm/80/s1688f16gemm/cutlass_tensorop_s1688f16gemm_256x128_16x3_nn_align4.cu.o [ 14%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_objs [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688f16gemm_objs.dir/generated/gemm/80/s1688f16gemm/cutlass_tensorop_s1688f16gemm_256x128_16x3_nt_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_objs.dir/generated/gemm/80/s1688gemm/all_sm80_s1688gemm_gemm_operations.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688bf16gemm_objs.dir/generated/gemm/80/s1688bf16gemm/cutlass_tensorop_s1688bf16gemm_256x128_16x3_tn_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_objs.dir/generated/gemm/80/s1688gemm/cutlass_tensorop_s1688gemm_128x128_16x4_nn_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_tf32_objs.dir/generated/gemm/80/s1688gemm_tf32/all_sm80_s1688gemm_tf32_gemm_operations.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688f16gemm_objs.dir/generated/gemm/80/s1688f16gemm/cutlass_tensorop_s1688f16gemm_256x128_16x3_tn_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_tf32_objs.dir/generated/gemm/80/s1688gemm_tf32/cutlass_tensorop_s1688gemm_tf32_256x128_16x3_nn_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688bf16gemm_objs.dir/generated/gemm/80/s1688bf16gemm/cutlass_tensorop_s1688bf16gemm_256x128_16x3_tt_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_objs.dir/generated/gemm/80/s1688gemm/cutlass_tensorop_s1688gemm_128x128_16x4_nt_align4.cu.o [ 14%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688f16gemm_objs.dir/generated/gemm/80/s1688f16gemm/cutlass_tensorop_s1688f16gemm_256x128_16x3_tt_align4.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_tf32_objs.dir/generated/gemm/80/s1688gemm_tf32/cutlass_tensorop_s1688gemm_tf32_256x128_16x3_nt_align4.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s1688bf16gemm_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_objs.dir/generated/gemm/80/s1688gemm/cutlass_tensorop_s1688gemm_128x128_16x4_tn_align4.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_tf32_objs.dir/generated/gemm/80/s1688gemm_tf32/cutlass_tensorop_s1688gemm_tf32_256x128_16x3_tn_align4.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s1688f16gemm_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_objs.dir/generated/gemm/80/s1688gemm/cutlass_tensorop_s1688gemm_128x128_16x4_tt_align4.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688tf32gemm_objs.dir/generated/gemm/80/s1688tf32gemm/all_sm80_s1688tf32gemm_gemm_operations.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688tf32gemm_objs.dir/generated/gemm/80/s1688tf32gemm/cutlass_tensorop_s1688tf32gemm_256x128_16x3_nn_align4.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688gemm_tf32_objs.dir/generated/gemm/80/s1688gemm_tf32/cutlass_tensorop_s1688gemm_tf32_256x128_16x3_tt_align4.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688tf32gemm_objs.dir/generated/gemm/80/s1688tf32gemm/cutlass_tensorop_s1688tf32gemm_256x128_16x3_nt_align4.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s1688gemm_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688tf32gemm_objs.dir/generated/gemm/80/s1688tf32gemm/cutlass_tensorop_s1688tf32gemm_256x128_16x3_tn_align4.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s4_i168128spgemm_s4_objs.dir/generated/gemm/80/s4_i168128spgemm_s4/all_sm80_s4_i168128spgemm_s4_gemm_operations.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s1688gemm_tf32_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s4_i168128spgemm_s4_objs.dir/generated/gemm/80/s4_i168128spgemm_s4/cutlass_tensorop_s4_i168128spgemm_s4_64x64_256x4_tn_align32.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s4_i16864gemm_s4_objs.dir/generated/gemm/80/s4_i16864gemm_s4/all_sm80_s4_i16864gemm_s4_gemm_operations.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s4_i16864gemm_s4_objs.dir/generated/gemm/80/s4_i16864gemm_s4/cutlass_tensorop_s4_i16864gemm_s4_256x128_128x3_tn_align32.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s1688tf32gemm_objs.dir/generated/gemm/80/s1688tf32gemm/cutlass_tensorop_s1688tf32gemm_256x128_16x3_tt_align4.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s8_i16832gemm_s4_s8_objs.dir/generated/gemm/80/s8_i16832gemm_s4_s8/all_sm80_s8_i16832gemm_s4_s8_gemm_operations.cu.o ptxas , line 3; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas , line 3; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures [ 15%] Built target cutlass_library_gemm_sm80_s4_i168128spgemm_s4_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s4_i16864gemm_s4_objs.dir/generated/gemm/80/s4_i16864gemm_s4/cutlass_tensorop_s4_i16864gemm_s4_256x128_128x3_n64t64_align32.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s8_i16832gemm_s4_s8_objs.dir/generated/gemm/80/s8_i16832gemm_s4_s8/cutlass_tensorop_s8_i16832gemm_s4_s8_256x128_64x3_tn_align32.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s8_i16832gemm_s8_objs.dir/generated/gemm/80/s8_i16832gemm_s8/all_sm80_s8_i16832gemm_s8_gemm_operations.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s1688tf32gemm_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s8_i16832gemm_s8_objs.dir/generated/gemm/80/s8_i16832gemm_s8/cutlass_tensorop_s8_i16832gemm_s8_256x128_64x3_tn_align16.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s8_i16832gemm_s8_s4_objs.dir/generated/gemm/80/s8_i16832gemm_s8_s4/all_sm80_s8_i16832gemm_s8_s4_gemm_operations.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s8_i16832gemm_s8_s4_objs.dir/generated/gemm/80/s8_i16832gemm_s8_s4/cutlass_tensorop_s8_i16832gemm_s8_s4_256x128_64x3_tn_align32.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s4_i16864gemm_s4_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s8_i16832gemm_s8_objs.dir/generated/gemm/80/s8_i16832gemm_s8/cutlass_tensorop_s8_i16832gemm_s8_256x128_64x3_n32t32_align16.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s8_i16832gemm_s4_s8_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s8_i16864spgemm_s8_objs.dir/generated/gemm/80/s8_i16864spgemm_s8/all_sm80_s8_i16864spgemm_s8_gemm_operations.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_s8_i16864spgemm_s8_objs.dir/generated/gemm/80/s8_i16864spgemm_s8/cutlass_tensorop_s8_i16864spgemm_s8_128x64_128x3_tn_align16.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_sgemm_objs.dir/generated/gemm/80/sgemm/all_sm80_sgemm_gemm_operations.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_sgemm_objs.dir/generated/gemm/80/sgemm/cutlass_simt_sgemm_256x128_8x5_nn_align1.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s8_i16832gemm_s8_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_sgemm_objs.dir/generated/gemm/80/sgemm/cutlass_simt_sgemm_256x128_8x5_nt_align1.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s8_i16832gemm_s8_s4_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_sgemm_objs.dir/generated/gemm/80/sgemm/cutlass_simt_sgemm_256x128_8x5_tn_align1.cu.o [ 15%] Built target cutlass_library_gemm_sm80_s8_i16864spgemm_s8_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_sgemm_objs.dir/generated/gemm/80/sgemm/cutlass_simt_sgemm_256x128_8x5_tt_align1.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_tf32_s1688gemm_tf32_objs.dir/generated/gemm/80/tf32_s1688gemm_tf32/all_sm80_tf32_s1688gemm_tf32_gemm_operations.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_tf32_s1688gemm_tf32_objs.dir/generated/gemm/80/tf32_s1688gemm_tf32/cutlass_tensorop_tf32_s1688gemm_tf32_256x128_16x3_nn_align4.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_u4_i16864gemm_u4_objs.dir/generated/gemm/80/u4_i16864gemm_u4/all_sm80_u4_i16864gemm_u4_gemm_operations.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_u4_i16864gemm_u4_objs.dir/generated/gemm/80/u4_i16864gemm_u4/cutlass_tensorop_u4_i16864gemm_u4_256x128_128x3_tn_align32.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_tf32_s1688gemm_tf32_objs.dir/generated/gemm/80/tf32_s1688gemm_tf32/cutlass_tensorop_tf32_s1688gemm_tf32_256x128_16x3_nt_align4.cu.o [ 15%] Built target cutlass_library_gemm_sm80_sgemm_objs [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_tf32_s1688gemm_tf32_objs.dir/generated/gemm/80/tf32_s1688gemm_tf32/cutlass_tensorop_tf32_s1688gemm_tf32_256x128_16x3_tn_align4.cu.o [ 15%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_u4_i16864gemm_u4_objs.dir/generated/gemm/80/u4_i16864gemm_u4/cutlass_tensorop_u4_i16864gemm_u4_256x128_128x3_n64t64_align32.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_u8_i16832gemm_u8_objs.dir/generated/gemm/80/u8_i16832gemm_u8/all_sm80_u8_i16832gemm_u8_gemm_operations.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_u8_i16832gemm_u8_objs.dir/generated/gemm/80/u8_i16832gemm_u8/cutlass_tensorop_u8_i16832gemm_u8_256x128_64x3_tn_align16.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_tf32_s1688gemm_tf32_objs.dir/generated/gemm/80/tf32_s1688gemm_tf32/cutlass_tensorop_tf32_s1688gemm_tf32_256x128_16x3_tt_align4.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/all_sm80_z884gemm_gemm_operations.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_nn_align1.cu.o [ 16%] Built target cutlass_library_gemm_sm80_u4_i16864gemm_u4_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_cn_align1.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_u8_i16832gemm_u8_objs.dir/generated/gemm/80/u8_i16832gemm_u8/cutlass_tensorop_u8_i16832gemm_u8_256x128_64x3_n32t32_align16.cu.o [ 16%] Built target cutlass_library_gemm_sm80_tf32_s1688gemm_tf32_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_nc_align1.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3_objs.dir/generated/gemm/89/s16864fastaccumspgemm_e4m3/all_sm89_s16864fastaccumspgemm_e4m3_gemm_operations.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2_objs.dir/generated/gemm/89/s16864fastaccumspgemm_e4m3_e5m2/all_sm89_s16864fastaccumspgemm_e4m3_e5m2_gemm_operations.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3_objs.dir/generated/gemm/89/s16864fastaccumspgemm_e4m3/cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2_objs.dir/generated/gemm/89/s16864fastaccumspgemm_e4m3_e5m2/cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.cu.o [ 16%] Built target cutlass_library_gemm_sm80_u8_i16832gemm_u8_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_cc_align1.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2_objs.dir/generated/gemm/89/s16864fastaccumspgemm_e5m2/all_sm89_s16864fastaccumspgemm_e5m2_gemm_operations.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2_objs.dir/generated/gemm/89/s16864fastaccumspgemm_e5m2/cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.cu.o ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 755; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 759; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 763; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 767; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 771; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 775; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 779; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 783; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 787; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 791; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 795; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 799; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 803; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 807; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 811; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 815; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1015; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1019; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1023; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1027; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1031; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1035; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1039; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1043; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1047; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1051; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1055; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1059; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1063; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1067; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1071; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c57_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1075; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 755; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 759; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 763; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 767; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 771; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 775; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 779; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 783; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 787; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 791; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 795; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 799; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 803; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 807; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 811; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 815; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1015; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1019; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1023; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1027; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1031; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1035; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1039; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1043; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1047; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1051; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1055; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1059; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1063; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1067; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1071; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006c69_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1075; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures [ 16%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_nt_align1.cu.o [ 16%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_ct_align1.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3_objs.dir/generated/gemm/89/s16864fastaccumspgemm_e5m2_e4m3/all_sm89_s16864fastaccumspgemm_e5m2_e4m3_gemm_operations.cu.o ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 755; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 759; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 763; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 767; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 771; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 775; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 779; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 783; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 787; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 791; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 795; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 799; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 803; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 807; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 811; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 815; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1015; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1019; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1023; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1027; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1031; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1035; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1039; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1043; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1047; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1051; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1055; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1059; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1063; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1067; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1071; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006cde_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1075; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3_objs.dir/generated/gemm/89/s16864fastaccumspgemm_e5m2_e4m3/cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.cu.o [ 16%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_nh_align1.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864spgemm_e4m3_objs.dir/generated/gemm/89/s16864spgemm_e4m3/all_sm89_s16864spgemm_e4m3_gemm_operations.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864spgemm_e4m3_objs.dir/generated/gemm/89/s16864spgemm_e4m3/cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.cu.o ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 755; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 759; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 763; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 767; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 771; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 775; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 779; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 783; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 787; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 791; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 795; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 799; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 803; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 807; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 811; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 815; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1015; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1019; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1023; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1027; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1031; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1035; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1039; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1043; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1047; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1051; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1055; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1059; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1063; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1067; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1071; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006d70_00000000-7_cutlass_tensorop_s16864fastaccumspgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1075; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864spgemm_e4m3_e5m2_objs.dir/generated/gemm/89/s16864spgemm_e4m3_e5m2/all_sm89_s16864spgemm_e4m3_e5m2_gemm_operations.cu.o [ 16%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864spgemm_e4m3_e5m2_objs.dir/generated/gemm/89/s16864spgemm_e4m3_e5m2/cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_ch_align1.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864spgemm_e5m2_objs.dir/generated/gemm/89/s16864spgemm_e5m2/all_sm89_s16864spgemm_e5m2_gemm_operations.cu.o ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 755; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 759; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 763; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 767; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 771; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 775; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 779; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 783; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 787; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 791; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 795; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 799; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 803; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 807; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 811; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 815; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1015; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1019; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1023; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1027; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1031; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1035; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1039; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1043; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1047; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1051; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1055; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1059; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1063; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1067; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1071; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006dca_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1075; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures [ 16%] Built target cutlass_library_gemm_sm89_s16864spgemm_e4m3_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864spgemm_e5m2_objs.dir/generated/gemm/89/s16864spgemm_e5m2/cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_tn_align1.cu.o ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 755; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 759; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 763; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 767; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 771; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 775; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 779; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 783; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 787; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 791; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 795; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 799; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 803; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 807; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 811; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 815; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1015; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1019; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1023; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1027; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1031; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1035; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1039; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1043; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1047; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1051; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1055; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1059; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1063; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1067; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1071; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e26_00000000-7_cutlass_tensorop_s16864spgemm_e4m3_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1075; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures [ 16%] Built target cutlass_library_gemm_sm89_s16864spgemm_e4m3_e5m2_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_hn_align1.cu.o [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864spgemm_e5m2_e4m3_objs.dir/generated/gemm/89/s16864spgemm_e5m2_e4m3/all_sm89_s16864spgemm_e5m2_e4m3_gemm_operations.cu.o ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 755; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 759; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 763; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 767; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 771; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 775; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 779; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 783; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 787; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 791; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 795; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 799; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 803; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 807; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 811; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 815; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1015; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1019; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1023; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1027; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1031; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1035; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1039; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1043; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1047; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1051; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1055; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1059; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1063; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1067; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1071; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006e95_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_128x64_128x3_tn_align16.compute_89.ptx, line 1075; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm89_s16864spgemm_e5m2_e4m3_objs.dir/generated/gemm/89/s16864spgemm_e5m2_e4m3/cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.cu.o [ 16%] Built target cutlass_library_gemm_sm89_s16864spgemm_e5m2_objs [ 16%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_tc_align1.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/all_sm90_bf16_s64x128x16gemm_bf16_gemm_operations.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/all_sm90_bf16_s64x128x32gemm_e4m3_gemm_operations.cu.o ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 755; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 759; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 763; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 767; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 771; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 775; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 779; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 783; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 787; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 791; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 795; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 799; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 803; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 807; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 811; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 815; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1015; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1019; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1023; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1027; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1031; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1035; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1039; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1043; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1047; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1051; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1055; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1059; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1063; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1067; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1071; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures ptxas /tmp/tmpxft_00006f33_00000000-7_cutlass_tensorop_s16864spgemm_e5m2_e4m3_128x64_128x3_tn_align16.compute_89.ptx, line 1075; info : Advisory: Modifier '.sp::ordered_metadata' should be used on instruction 'mma' instead of modifier '.sp' as it is expected to have substantially reduced performance on some future architectures [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_hc_align1.cu.o [ 17%] Built target cutlass_library_gemm_sm89_s16864spgemm_e5m2_e4m3_objs [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_tt_align1.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_ht_align1.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/all_sm90_bf16_s64x128x32gemm_e4m3_e5m2_gemm_operations.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_th_align1.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm80_z884gemm_objs.dir/generated/gemm/80/z884gemm/cutlass_tensorop_z884gemm_128x64_8x3_hh_align1.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 17%] Built target cutlass_library_gemm_sm80_z884gemm_objs [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/all_sm90_bf16_s64x128x32gemm_e5m2_gemm_operations.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o [ 17%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align8.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 18%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align8.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 19%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align8.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 20%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align8.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 21%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_objs [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_objs [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/all_sm90_bf16_s64x128x32gemm_e5m2_e4m3_gemm_operations.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/all_sm90_bf16_s64x128x32spgemm_bf16_gemm_operations.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_objs [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/all_sm90_bf16_s64x128x64spgemm_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ttn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_nnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ntn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_tnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ttn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_nnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x1x1_0_ntn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Built target cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_objs [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/all_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 22%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 23%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 23%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 23%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 23%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 23%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 23%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 23%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 23%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_objs [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/all_sm90_bf16_s64x128x64spgemm_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_objs [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/all_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 24%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8.cu.o [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 25%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_objs [ 25%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 26%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 26%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 26%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_d1684gemm_objs.dir/generated/gemm/90/d1684gemm/all_sm90_d1684gemm_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_d1684gemm_objs.dir/generated/gemm/90/d1684gemm/cutlass_sm90_tensorop_d1684gemm_f64_f64_f64_f64_f64_128x128x16_1x1x1_3_nnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_d1684gemm_objs.dir/generated/gemm/90/d1684gemm/cutlass_sm90_tensorop_d1684gemm_f64_f64_f64_f64_f64_128x128x16_1x1x1_3_ntn_align1.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_d1684gemm_objs.dir/generated/gemm/90/d1684gemm/cutlass_sm90_tensorop_d1684gemm_f64_f64_f64_f64_f64_128x128x16_1x1x1_3_tnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_d1684gemm_objs.dir/generated/gemm/90/d1684gemm/cutlass_sm90_tensorop_d1684gemm_f64_f64_f64_f64_f64_128x128x16_1x1x1_3_ttn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Built target cutlass_library_gemm_sm90_d1684gemm_objs [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/all_sm90_f16_s64x128x16gemm_f16_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_bf16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_bf16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_bf16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_objs [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/all_sm90_f16_s64x128x32gemm_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 27%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_bf16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_objs [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/all_sm90_f16_s64x128x32gemm_e4m3_e5m2_gemm_operations.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 28%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 29%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 29%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 29%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 29%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 29%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 29%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/bf16_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_bf16_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 30%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_objs [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/all_sm90_f16_s64x128x32gemm_e5m2_gemm_operations.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ttn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_nnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ntn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_tnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ttn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_nnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs.dir/generated/gemm/90/f16_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f16_f16_128x128x64_1x1x1_0_ntn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 30%] Built target cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_objs [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/all_sm90_f16_s64x128x32gemm_e5m2_e4m3_gemm_operations.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 30%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 31%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_objs [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/all_sm90_f16_s64x128x32spgemm_f16_gemm_operations.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 31%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_objs [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/all_sm90_f16_s64x128x64spgemm_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 32%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_objs [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/all_sm90_f16_s64x128x64spgemm_e4m3_e5m2_gemm_operations.cu.o [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 32%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 33%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 34%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 34%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 34%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 34%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 34%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 34%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 34%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_objs [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/all_sm90_f16_s64x128x64spgemm_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_objs [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 35%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/all_sm90_f16_s64x128x64spgemm_e5m2_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_f16_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_objs [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/all_sm90_gz1684gemm_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_nnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_cnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu.o [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_ncn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_ccn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_ntn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_ctn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_nhn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_chn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_tnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_hnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 36%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_tcn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_hcn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_ttn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_htn_align1.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_thn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_gz1684gemm_objs.dir/generated/gemm/90/gz1684gemm/cutlass_sm90_tensorop_gz1684gemm_cf64_cf64_cf64_cf64_cf64_64x64x8_1x1x1_3_hhn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Built target cutlass_library_gemm_sm90_gz1684gemm_objs [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/all_sm90_h64x128x16gemm_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f16_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_objs [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/all_sm90_h64x128x32spgemm_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_f16_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f16_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 37%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_objs [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/all_sm90_i64x128x32gemm_s8_gemm_operations.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 37%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs.dir/generated/gemm/90/i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 38%] Built target cutlass_library_gemm_sm90_i64x128x32gemm_s8_objs [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/all_sm90_i64x128x32gemm_u8_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/f16_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Built target cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_objs [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/all_sm90_i64x128x64spgemm_s8_gemm_operations.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 38%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs.dir/generated/gemm/90/i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s32_s32_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu.o [ 39%] Built target cutlass_library_gemm_sm90_i64x128x32gemm_u8_objs [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/all_sm90_i64x128x64spgemm_u8_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ttn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_nnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ntn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_tnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Built target cutlass_library_gemm_sm90_i64x128x64spgemm_s8_objs [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ttn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/all_sm90_s64x128x16gemm_bf16_gemm_operations.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_nnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu.o [ 39%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu.o [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x16gemm_objs.dir/generated/gemm/90/h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_f16_f16_128x128x64_1x1x1_0_ntn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Built target cutlass_library_gemm_sm90_h64x128x16gemm_objs [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/all_sm90_s64x128x16gemm_f16_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 40%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s32_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Built target cutlass_library_gemm_sm90_i64x128x64spgemm_u8_objs [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/all_sm90_s64x128x16spgemm_tf32_gemm_operations.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 41%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu.o [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 42%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 43%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_h64x128x32spgemm_objs.dir/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_f16_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 44%] Built target cutlass_library_gemm_sm90_h64x128x32spgemm_objs [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/all_sm90_s64x128x16tf32spgemm_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_tnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Built target cutlass_library_gemm_sm90_s64x128x16gemm_bf16_objs [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ttn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/all_sm90_s64x128x32gemm_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_nnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 44%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs.dir/generated/gemm/90/s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_f32_f32_128x128x64_1x1x1_0_ntn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Built target cutlass_library_gemm_sm90_s64x128x16gemm_f16_objs [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/all_sm90_s64x128x32gemm_e4m3_e5m2_gemm_operations.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs.dir/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::tfloat32_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::tfloat32_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16spgemm_tf32/cutlass3x_sm90_sptensorop_s64x128x16spgemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Built target cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_objs [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/all_sm90_s64x128x32gemm_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 45%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 46%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 46%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 46%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 46%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 47%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 48%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_objs [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 48%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs.dir/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/all_sm90_s64x128x32gemm_e5m2_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(247): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk1to2::storage() const [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=float, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, float, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<4, uint8_t>, cute::C<32>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x16tf32spgemm/cutlass3x_sm90_sptensorop_s64x128x16tf32spgemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 49%] Built target cutlass_library_gemm_sm90_s64x128x16tf32spgemm_objs [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/all_sm90_s64x128x32spgemm_bf16_gemm_operations.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 49%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_objs [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/all_sm90_s64x128x32spgemm_f16_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_objs [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/all_sm90_s64x128x64spgemm_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 49%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_objs [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_objs [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/all_sm90_s64x128x64spgemm_e4m3_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/all_sm90_s64x128x64spgemm_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 50%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 51%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 52%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_objs [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/all_sm90_s64x128x64spgemm_e5m2_e4m3_gemm_operations.cu.o [ 53%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_objs [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/all_sm90_s64x128x8gemm_tf32_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 53%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_ttn_align4_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_nnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_ntn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Built target cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_objs [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/all_sm90_s64x128x8tf32gemm_gemm_operations.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_f32_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Built target cutlass_library_gemm_sm90_s64x128x32spgemm_f16_objs [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 54%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/all_sm90_s8_i64x128x32gemm_s8_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4_warpspecialized.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_ttn_align4.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_ttn_align4_warpspecialized.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4_stream_k_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_nnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_ntn_align4.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 55%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4_warpspecialized.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_s8_s8_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Built target cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_objs [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/all_sm90_s8_i64x128x32gemm_u8_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align16.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ttn_align4_stream_k_warpspecialized_cooperative.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_f32_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4_stream_k_warpspecialized_cooperative.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 56%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e4m3_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_1x2x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_f32_e5m2_64x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_objs [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x1x1_0_tnn_align8_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/all_sm90_s8_i64x128x64spgemm_s8_gemm_operations.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x1x1_0_tnn_align8_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu.o [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x1x1_0_tnn_align4_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 57%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs.dir/generated/gemm/90/s8_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_s8_u8_128x128x128_1x1x1_0_tnn_align4_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Built target cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_objs [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4_warpspecialized_pingpong.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_cooperative_epi_tma.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/all_sm90_s8_i64x128x64spgemm_u8_gemm_operations.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ttn_align4_stream_k_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32.cu /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4_stream_k_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_256x128x32_2x1x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ttn_align4_stream_k_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=int8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, int8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_s8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_s8_s8_s32_s8_s8_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o [ 58%] Built target cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_objs [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/all_sm90_void_h64x128x16gemm_gemm_operations.cu.o [ 58%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_ttn_align4_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 59%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 59%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 59%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/s8_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_s8_u8_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 59%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 59%] Built target cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_objs [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_warpspecialized_cooperative_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_nnn_align4.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/all_sm90_void_h64x128x32spgemm_gemm_operations.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_ntn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_64x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_2x1x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align4.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_cooperative_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 60%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_ttn_align4_warpspecialized.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_nnn_align4.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4_warpspecialized_pingpong.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4_stream_k_warpspecialized_cooperative.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_ntn_align4.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 61%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_64x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_tnn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4_warpspecialized_pingpong.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ttn_align4_stream_k_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align2_cpasync_warpspecialized.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align2_cpasync_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align2_stream_k_cpasync_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_pingpong_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x16gemm_objs.dir/generated/gemm/90/void_h64x128x16gemm/cutlass3x_sm90_tensorop_h64x128x16gemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_nnn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Built target cutlass_library_gemm_sm90_void_h64x128x16gemm_objs [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align1_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align1_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align1_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/all_sm90_void_i64x128x32gemm_s8_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align1_cpasync_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align1_cpasync_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align1_stream_k_cpasync_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align1_cpasync_warpspecialized_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align1_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align1_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align1_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align1_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x2x1_0_ntn_align4_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs.dir/generated/gemm/90/s64x128x8gemm_tf32/cutlass3x_sm90_tensorop_s64x128x8gemm_tf32_tf32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align1_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Built target cutlass_library_gemm_sm90_s64x128x8gemm_tf32_objs [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 62%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/all_sm90_void_i64x128x32gemm_u8_gemm_operations.cu.o [ 63%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 63%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 63%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 63%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align2_cpasync_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 63%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 63%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 63%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align2_cpasync_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 63%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 63%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs.dir/generated/gemm/90/void_i64x128x32gemm_s8/cutlass3x_sm90_tensorop_i64x128x32gemm_s8_s8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align2_stream_k_cpasync_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 64%] Built target cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_objs [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align2_cpasync_warpspecialized_epi_nosmem.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align2_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/all_sm90_void_i64x128x64spgemm_u8_gemm_operations.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align2_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align1_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align1_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_tnn_align1_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align1_cpasync_warpspecialized.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs.dir/generated/gemm/90/void_i64x128x32gemm_u8/cutlass3x_sm90_tensorop_i64x128x32gemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align1_cpasync_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ttn_align1_stream_k_cpasync_warpspecialized_cooperative.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Built target cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_objs [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align1_cpasync_warpspecialized_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/all_sm90_void_s64x128x16gemm_bf16_gemm_operations.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align1_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_nnn_align1_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align1_cpasync_warpspecialized_epi_nosmem.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align1_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs.dir/generated/gemm/90/s64x128x8tf32gemm/cutlass3x_sm90_tensorop_s64x128x8tf32gemm_f32_f32_f32_f32_f32_128x128x32_1x1x1_0_ntn_align1_stream_k_cpasync_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 64%] Built target cutlass_library_gemm_sm90_s64x128x8tf32gemm_objs [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_2x1x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu.o [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/all_sm90_void_s64x128x16gemm_f16_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 64%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs.dir/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=uint8_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, uint8_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_i64x128x64spgemm_u8/cutlass3x_sm90_sptensorop_i64x128x64spgemm_u8_u8_s32_void_s32_128x128x128_1x2x1_0_tnn_align32_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Built target cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_objs [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 65%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32syrk_objs.dir/generated/rank_k/80/c1688tf32syrk/all_sm80_c1688tf32syrk_rank_k_operations.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32syrk_objs.dir/generated/rank_k/80/c1688tf32syrk/cutlass_tensorop_c1688tf32syrk_128x64_16x4_n_l_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32syrk_objs.dir/generated/rank_k/80/c1688tf32syrk/cutlass_tensorop_c1688tf32syrk_128x64_16x4_n_u_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32syrk_objs.dir/generated/rank_k/80/c1688tf32syrk/cutlass_tensorop_c1688tf32syrk_128x64_16x4_t_l_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32syrk_objs.dir/generated/rank_k/80/c1688tf32syrk/cutlass_tensorop_c1688tf32syrk_128x64_16x4_t_u_align1.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Built target cutlass_library_rank_k_sm80_c1688tf32syrk_objs [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3/all_sm90_void_s64x128x32gemm_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_objs [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3_e5m2/all_sm90_void_s64x128x32gemm_e4m3_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 66%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e4m3_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e4m3_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2_objs [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs.dir/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2/all_sm90_void_s64x128x32gemm_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_h64x128x32spgemm/cutlass3x_sm90_sptensorop_h64x128x32spgemm_f16_f16_f16_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 67%] Built target cutlass_library_gemm_sm90_void_h64x128x32spgemm_objs [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2_e4m3/all_sm90_void_s64x128x32gemm_e5m2_e4m3_gemm_operations.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_objs [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3_objs.dir/generated/gemm/90/void_s64x128x32gemm_e5m2_e4m3/cutlass3x_sm90_tensorop_s64x128x32gemm_e5m2_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align16_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/all_sm90_void_s64x128x32spgemm_bf16_gemm_operations.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 67%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3_objs [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/all_sm90_void_s64x128x32spgemm_f16_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 67%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 68%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 69%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs.dir/generated/gemm/90/void_s64x128x16gemm_bf16/cutlass3x_sm90_tensorop_s64x128x16gemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 70%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884herk_objs.dir/generated/rank_k/80/z884herk/all_sm80_z884herk_rank_k_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884herk_objs.dir/generated/rank_k/80/z884herk/cutlass_tensorop_z884herk_128x64_8x3_n_l_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884herk_objs.dir/generated/rank_k/80/z884herk/cutlass_tensorop_z884herk_128x64_8x3_n_u_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884herk_objs.dir/generated/rank_k/80/z884herk/cutlass_tensorop_z884herk_128x64_8x3_h_l_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884herk_objs.dir/generated/rank_k/80/z884herk/cutlass_tensorop_z884herk_128x64_8x3_h_u_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_rank_k_sm80_z884herk_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs.dir/generated/gemm/90/void_s64x128x16gemm_f16/cutlass3x_sm90_tensorop_s64x128x16gemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e4m3/all_sm90_void_s64x128x64spgemm_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/all_sm90_void_s64x128x64spgemm_e4m3_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e5m2/all_sm90_void_s64x128x64spgemm_e5m2_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e4m3_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e4m3_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e4m3_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e4m3_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e5m2_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/all_sm90_void_s64x128x64spgemm_e5m2_e4m3_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/all_sm90_z1684gemm_gemm_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_nnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_cnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_ncn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e4m3_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3_objs.dir/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_ccn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_ntn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_ctn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::float_e5m2_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::float_e5m2_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<128>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x64spgemm_e5m2_e4m3/cutlass3x_sm90_sptensorop_s64x128x64spgemm_e5m2_e4m3_f32_void_e5m2_256x128x128_1x2x1_0_tnn_align32_warpspecialized_cooperative_fp8_fastaccum_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_nhn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_chn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_tnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_hnn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_tcn_align1.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_hcn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_ttn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_htn_align1.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_cf32_cdgrad_optimized_cf32_objs.dir/generated/conv2d/50/cf32_cdgrad_optimized_cf32/all_sm50_cf32_cdgrad_optimized_cf32_conv2d_operations.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_thn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_cf32_cdgrad_optimized_cf32_objs.dir/generated/conv2d/50/cf32_cdgrad_optimized_cf32/cutlass_simt_cf32_cdgrad_optimized_cf32_128x64_8x2_nhwc_unity_stride_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_z1684gemm_objs.dir/generated/gemm/90/z1684gemm/cutlass_sm90_tensorop_z1684gemm_cf64_cf64_cf64_cf64_cf64_128x64x8_1x1x1_3_hhn_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_cf32_cdgrad_optimized_cf32_objs.dir/generated/conv2d/50/cf32_cdgrad_optimized_cf32/cutlass_simt_cf32_cdgrad_optimized_cf32_128x64_8x2_nhwc_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_gemm_sm90_z1684gemm_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 71%] Built target cutlass_library_conv2d_sm50_cf32_cdgrad_optimized_cf32_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_cf32_cfprop_optimized_cf32_objs.dir/generated/conv2d/50/cf32_cfprop_optimized_cf32/all_sm50_cf32_cfprop_optimized_cf32_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_cf32_cfprop_optimized_cf32_objs.dir/generated/conv2d/50/cf32_cfprop_optimized_cf32/cutlass_simt_cf32_cfprop_optimized_cf32_128x64_8x2_nhwc_align1.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_cf32_cwgrad_optimized_cf32_objs.dir/generated/conv2d/50/cf32_cwgrad_optimized_cf32/all_sm50_cf32_cwgrad_optimized_cf32_conv2d_operations.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_cf32_cwgrad_optimized_cf32_objs.dir/generated/conv2d/50/cf32_cwgrad_optimized_cf32/cutlass_simt_cf32_cwgrad_optimized_cf32_128x64_8x2_nhwc_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm50_cf32_cfprop_optimized_cf32_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm50_cf32_cwgrad_optimized_cf32_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_sdgrad_optimized_objs.dir/generated/conv2d/50/sdgrad_optimized/all_sm50_sdgrad_optimized_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_sdgrad_optimized_objs.dir/generated/conv2d/50/sdgrad_optimized/cutlass_simt_sdgrad_optimized_128x128_8x2_nhwc_unity_stride_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_sdgrad_optimized_objs.dir/generated/conv2d/50/sdgrad_optimized/cutlass_simt_sdgrad_optimized_128x128_8x2_nhwc_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_sfprop_optimized_objs.dir/generated/conv2d/50/sfprop_optimized/all_sm50_sfprop_optimized_conv2d_operations.cu.o [ 71%] Built target cutlass_library_conv2d_sm50_sdgrad_optimized_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_sfprop_optimized_objs.dir/generated/conv2d/50/sfprop_optimized/cutlass_simt_sfprop_optimized_128x128_8x2_nhwc_align1.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm50_sfprop_optimized_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_swgrad_optimized_objs.dir/generated/conv2d/50/swgrad_optimized/all_sm50_swgrad_optimized_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm50_swgrad_optimized_objs.dir/generated/conv2d/50/swgrad_optimized/cutlass_simt_swgrad_optimized_128x128_8x2_nhwc_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm60_hfprop_optimized_objs.dir/generated/conv2d/60/hfprop_optimized/all_sm60_hfprop_optimized_conv2d_operations.cu.o [ 71%] Built target cutlass_library_conv2d_sm50_swgrad_optimized_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm60_hfprop_optimized_objs.dir/generated/conv2d/60/hfprop_optimized/cutlass_simt_hfprop_optimized_64x32x9_1x8x8x32_3_filter3x3_nhwc_depthwise_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm60_hfprop_optimized_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_f16_s884dgrad_optimized_f16_objs.dir/generated/conv2d/70/f16_s884dgrad_optimized_f16/all_sm70_f16_s884dgrad_optimized_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_f16_s884dgrad_optimized_f16_objs.dir/generated/conv2d/70/f16_s884dgrad_optimized_f16/cutlass_tensorop_f16_s884dgrad_optimized_f16_256x128_32x2_nhwc_unity_stride_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_f16_s884fprop_optimized_f16_objs.dir/generated/conv2d/70/f16_s884fprop_optimized_f16/all_sm70_f16_s884fprop_optimized_f16_conv2d_operations.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_f16_s884dgrad_optimized_f16_objs.dir/generated/conv2d/70/f16_s884dgrad_optimized_f16/cutlass_tensorop_f16_s884dgrad_optimized_f16_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_f16_s884fprop_optimized_f16_objs.dir/generated/conv2d/70/f16_s884fprop_optimized_f16/cutlass_tensorop_f16_s884fprop_optimized_f16_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm70_f16_s884dgrad_optimized_f16_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 71%] Built target cutlass_library_conv2d_sm70_f16_s884fprop_optimized_f16_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_1x2x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_f16_s884wgrad_optimized_f16_objs.dir/generated/conv2d/70/f16_s884wgrad_optimized_f16/all_sm70_f16_s884wgrad_optimized_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_f16_s884wgrad_optimized_f16_objs.dir/generated/conv2d/70/f16_s884wgrad_optimized_f16/cutlass_tensorop_f16_s884wgrad_optimized_f16_256x128_32x2_nhwc_align8.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_h884dgrad_optimized_objs.dir/generated/conv2d/70/h884dgrad_optimized/all_sm70_h884dgrad_optimized_conv2d_operations.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_h884dgrad_optimized_objs.dir/generated/conv2d/70/h884dgrad_optimized/cutlass_tensorop_h884dgrad_optimized_256x128_32x2_nhwc_unity_stride_align8.cu.o [ 71%] Built target cutlass_library_conv2d_sm70_f16_s884wgrad_optimized_f16_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_h884dgrad_optimized_objs.dir/generated/conv2d/70/h884dgrad_optimized/cutlass_tensorop_h884dgrad_optimized_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_h884fprop_optimized_objs.dir/generated/conv2d/70/h884fprop_optimized/all_sm70_h884fprop_optimized_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm70_h884dgrad_optimized_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_h884fprop_optimized_objs.dir/generated/conv2d/70/h884fprop_optimized/cutlass_tensorop_h884fprop_optimized_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm70_h884fprop_optimized_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_h884wgrad_optimized_objs.dir/generated/conv2d/70/h884wgrad_optimized/all_sm70_h884wgrad_optimized_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_h884wgrad_optimized_objs.dir/generated/conv2d/70/h884wgrad_optimized/cutlass_tensorop_h884wgrad_optimized_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm70_h884wgrad_optimized_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_s884dgrad_optimized_f16_objs.dir/generated/conv2d/70/s884dgrad_optimized_f16/all_sm70_s884dgrad_optimized_f16_conv2d_operations.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_s884dgrad_optimized_f16_objs.dir/generated/conv2d/70/s884dgrad_optimized_f16/cutlass_tensorop_s884dgrad_optimized_f16_256x128_32x2_nhwc_unity_stride_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_s884dgrad_optimized_f16_objs.dir/generated/conv2d/70/s884dgrad_optimized_f16/cutlass_tensorop_s884dgrad_optimized_f16_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm70_s884dgrad_optimized_f16_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_s884fprop_optimized_f16_objs.dir/generated/conv2d/70/s884fprop_optimized_f16/all_sm70_s884fprop_optimized_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_s884fprop_optimized_f16_objs.dir/generated/conv2d/70/s884fprop_optimized_f16/cutlass_tensorop_s884fprop_optimized_f16_256x128_32x2_nhwc_align8.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_s884wgrad_optimized_f16_objs.dir/generated/conv2d/70/s884wgrad_optimized_f16/all_sm70_s884wgrad_optimized_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm70_s884wgrad_optimized_f16_objs.dir/generated/conv2d/70/s884wgrad_optimized_f16/cutlass_tensorop_s884wgrad_optimized_f16_256x128_32x2_nhwc_align8.cu.o [ 71%] Built target cutlass_library_conv2d_sm70_s884fprop_optimized_f16_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm70_s884wgrad_optimized_f16_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_cf32_cdgrad_optimized_cf32_objs.dir/generated/conv2d/75/cf32_cdgrad_optimized_cf32/all_sm75_cf32_cdgrad_optimized_cf32_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_cf32_cdgrad_optimized_cf32_objs.dir/generated/conv2d/75/cf32_cdgrad_optimized_cf32/cutlass_simt_cf32_cdgrad_optimized_cf32_128x128_8x5_nhwc_unity_stride_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_cf32_cfprop_optimized_cf32_objs.dir/generated/conv2d/75/cf32_cfprop_optimized_cf32/all_sm75_cf32_cfprop_optimized_cf32_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_cf32_cfprop_optimized_cf32_objs.dir/generated/conv2d/75/cf32_cfprop_optimized_cf32/cutlass_simt_cf32_cfprop_optimized_cf32_128x128_8x5_nhwc_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_cf32_cdgrad_optimized_cf32_objs.dir/generated/conv2d/75/cf32_cdgrad_optimized_cf32/cutlass_simt_cf32_cdgrad_optimized_cf32_128x128_8x5_nhwc_align1.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm75_cf32_cfprop_optimized_cf32_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm75_cf32_cdgrad_optimized_cf32_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_cf32_cwgrad_optimized_cf32_objs.dir/generated/conv2d/75/cf32_cwgrad_optimized_cf32/all_sm75_cf32_cwgrad_optimized_cf32_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_cf32_cwgrad_optimized_cf32_objs.dir/generated/conv2d/75/cf32_cwgrad_optimized_cf32/cutlass_simt_cf32_cwgrad_optimized_cf32_128x128_8x5_nhwc_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm75_cf32_cwgrad_optimized_cf32_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688dgrad_optimized_f16_objs.dir/generated/conv2d/75/f16_s1688dgrad_optimized_f16/all_sm75_f16_s1688dgrad_optimized_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_nnn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688dgrad_optimized_f16_objs.dir/generated/conv2d/75/f16_s1688dgrad_optimized_f16/cutlass_tensorop_f16_s1688dgrad_optimized_f16_256x128_32x2_nhwc_unity_stride_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688dgrad_optimized_f16_objs.dir/generated/conv2d/75/f16_s1688dgrad_optimized_f16/cutlass_tensorop_f16_s1688dgrad_optimized_f16_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Built target cutlass_library_conv2d_sm75_f16_s1688dgrad_optimized_f16_objs [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 71%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688fprop_few_channels_f16_objs.dir/generated/conv2d/75/f16_s1688fprop_few_channels_f16/all_sm75_f16_s1688fprop_few_channels_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688fprop_few_channels_f16_objs.dir/generated/conv2d/75/f16_s1688fprop_few_channels_f16/cutlass_tensorop_f16_s1688fprop_few_channels_f16_128x64_32x2_nhwc_align1.cu.o [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688fprop_fixed_channels_f16_objs.dir/generated/conv2d/75/f16_s1688fprop_fixed_channels_f16/all_sm75_f16_s1688fprop_fixed_channels_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688fprop_fixed_channels_f16_objs.dir/generated/conv2d/75/f16_s1688fprop_fixed_channels_f16/cutlass_tensorop_f16_s1688fprop_fixed_channels_f16_128x64_32x2_nhwc_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 72%] Built target cutlass_library_conv2d_sm75_f16_s1688fprop_few_channels_f16_objs [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o [ 72%] Built target cutlass_library_conv2d_sm75_f16_s1688fprop_fixed_channels_f16_objs [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu.o [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688fprop_optimized_f16_objs.dir/generated/conv2d/75/f16_s1688fprop_optimized_f16/all_sm75_f16_s1688fprop_optimized_f16_conv2d_operations.cu.o [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688fprop_optimized_f16_objs.dir/generated/conv2d/75/f16_s1688fprop_optimized_f16/cutlass_tensorop_f16_s1688fprop_optimized_f16_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Built target cutlass_library_conv2d_sm75_f16_s1688fprop_optimized_f16_objs [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688wgrad_optimized_f16_objs.dir/generated/conv2d/75/f16_s1688wgrad_optimized_f16/all_sm75_f16_s1688wgrad_optimized_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_f16_s1688wgrad_optimized_f16_objs.dir/generated/conv2d/75/f16_s1688wgrad_optimized_f16/cutlass_tensorop_f16_s1688wgrad_optimized_f16_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688dgrad_optimized_objs.dir/generated/conv2d/75/h1688dgrad_optimized/all_sm75_h1688dgrad_optimized_conv2d_operations.cu.o [ 72%] Built target cutlass_library_conv2d_sm75_f16_s1688wgrad_optimized_f16_objs [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688dgrad_optimized_objs.dir/generated/conv2d/75/h1688dgrad_optimized/cutlass_tensorop_h1688dgrad_optimized_256x128_32x2_nhwc_unity_stride_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688dgrad_optimized_objs.dir/generated/conv2d/75/h1688dgrad_optimized/cutlass_tensorop_h1688dgrad_optimized_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 72%] Built target cutlass_library_conv2d_sm75_h1688dgrad_optimized_objs [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688fprop_few_channels_objs.dir/generated/conv2d/75/h1688fprop_few_channels/all_sm75_h1688fprop_few_channels_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688fprop_few_channels_objs.dir/generated/conv2d/75/h1688fprop_few_channels/cutlass_tensorop_h1688fprop_few_channels_128x64_32x2_nhwc_align1.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Built target cutlass_library_conv2d_sm75_h1688fprop_few_channels_objs [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::ColumnMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::MN, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ntn_align8_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688fprop_fixed_channels_objs.dir/generated/conv2d/75/h1688fprop_fixed_channels/all_sm75_h1688fprop_fixed_channels_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688fprop_fixed_channels_objs.dir/generated/conv2d/75/h1688fprop_fixed_channels/cutlass_tensorop_h1688fprop_fixed_channels_128x64_32x2_nhwc_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Built target cutlass_library_conv2d_sm75_h1688fprop_fixed_channels_objs [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688fprop_optimized_objs.dir/generated/conv2d/75/h1688fprop_optimized/all_sm75_h1688fprop_optimized_conv2d_operations.cu.o [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688fprop_optimized_objs.dir/generated/conv2d/75/h1688fprop_optimized/cutlass_tensorop_h1688fprop_optimized_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688wgrad_optimized_objs.dir/generated/conv2d/75/h1688wgrad_optimized/all_sm75_h1688wgrad_optimized_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 72%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_h1688wgrad_optimized_objs.dir/generated/conv2d/75/h1688wgrad_optimized/cutlass_tensorop_h1688wgrad_optimized_256x128_32x2_nhwc_align8.cu.o [ 72%] Built target cutlass_library_conv2d_sm75_h1688fprop_optimized_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_h1688wgrad_optimized_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_i8816fprop_optimized_s8_objs.dir/generated/conv2d/75/i8816fprop_optimized_s8/all_sm75_i8816fprop_optimized_s8_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_i8816fprop_optimized_s8_objs.dir/generated/conv2d/75/i8816fprop_optimized_s8/cutlass_tensorop_i8816fprop_optimized_s8_256x128_64x2_nhwc_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_i8816fprop_optimized_s8_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_i8816fprop_optimized_u8_objs.dir/generated/conv2d/75/i8816fprop_optimized_u8/all_sm75_i8816fprop_optimized_u8_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_i8816fprop_optimized_u8_objs.dir/generated/conv2d/75/i8816fprop_optimized_u8/cutlass_tensorop_i8816fprop_optimized_u8_256x128_64x2_nhwc_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_i8816fprop_optimized_u8_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_i8832fprop_optimized_s4_objs.dir/generated/conv2d/75/i8832fprop_optimized_s4/all_sm75_i8832fprop_optimized_s4_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_i8832fprop_optimized_s4_objs.dir/generated/conv2d/75/i8832fprop_optimized_s4/cutlass_tensorop_i8832fprop_optimized_s4_256x128_128x2_nhwc_align32.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_i8832fprop_optimized_s4_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_i8832fprop_optimized_u4_objs.dir/generated/conv2d/75/i8832fprop_optimized_u4/all_sm75_i8832fprop_optimized_u4_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_i8832fprop_optimized_u4_objs.dir/generated/conv2d/75/i8832fprop_optimized_u4/cutlass_tensorop_i8832fprop_optimized_u4_256x128_128x2_nhwc_align32.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_i8832fprop_optimized_u4_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688dgrad_optimized_f16_objs.dir/generated/conv2d/75/s1688dgrad_optimized_f16/all_sm75_s1688dgrad_optimized_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688dgrad_optimized_f16_objs.dir/generated/conv2d/75/s1688dgrad_optimized_f16/cutlass_tensorop_s1688dgrad_optimized_f16_256x128_32x2_nhwc_unity_stride_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688dgrad_optimized_f16_objs.dir/generated/conv2d/75/s1688dgrad_optimized_f16/cutlass_tensorop_s1688dgrad_optimized_f16_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_tnn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_s1688dgrad_optimized_f16_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688fprop_few_channels_f16_objs.dir/generated/conv2d/75/s1688fprop_few_channels_f16/all_sm75_s1688fprop_few_channels_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688fprop_few_channels_f16_objs.dir/generated/conv2d/75/s1688fprop_few_channels_f16/cutlass_tensorop_s1688fprop_few_channels_f16_128x64_32x2_nhwc_align1.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_s1688fprop_few_channels_f16_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688fprop_fixed_channels_f16_objs.dir/generated/conv2d/75/s1688fprop_fixed_channels_f16/all_sm75_s1688fprop_fixed_channels_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688fprop_fixed_channels_f16_objs.dir/generated/conv2d/75/s1688fprop_fixed_channels_f16/cutlass_tensorop_s1688fprop_fixed_channels_f16_128x64_32x2_nhwc_align4.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_s1688fprop_fixed_channels_f16_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688fprop_optimized_f16_objs.dir/generated/conv2d/75/s1688fprop_optimized_f16/all_sm75_s1688fprop_optimized_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688fprop_optimized_f16_objs.dir/generated/conv2d/75/s1688fprop_optimized_f16/cutlass_tensorop_s1688fprop_optimized_f16_256x128_32x2_nhwc_align8.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688wgrad_optimized_f16_objs.dir/generated/conv2d/75/s1688wgrad_optimized_f16/all_sm75_s1688wgrad_optimized_f16_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s1688wgrad_optimized_f16_objs.dir/generated/conv2d/75/s1688wgrad_optimized_f16/cutlass_tensorop_s1688wgrad_optimized_f16_256x128_32x2_nhwc_align8.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_s1688fprop_optimized_f16_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Built target cutlass_library_conv2d_sm75_s1688wgrad_optimized_f16_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s4_i8832fprop_optimized_s4_objs.dir/generated/conv2d/75/s4_i8832fprop_optimized_s4/all_sm75_s4_i8832fprop_optimized_s4_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s4_i8832fprop_optimized_s4_objs.dir/generated/conv2d/75/s4_i8832fprop_optimized_s4/cutlass_tensorop_s4_i8832fprop_optimized_s4_256x128_128x2_nhwc_align32.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s4_i8832fprop_optimized_s4_objs.dir/generated/conv2d/75/s4_i8832fprop_optimized_s4/cutlass_tensorop_s4_i8832fprop_optimized_s4_256x128_128x2_nc64hw64_align32.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o [ 73%] Built target cutlass_library_conv2d_sm75_s4_i8832fprop_optimized_s4_objs [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_f32_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s8_i8816fprop_few_channels_s8_objs.dir/generated/conv2d/75/s8_i8816fprop_few_channels_s8/all_sm75_s8_i8816fprop_few_channels_s8_conv2d_operations.cu.o [ 73%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s8_i8816fprop_few_channels_s8_objs.dir/generated/conv2d/75/s8_i8816fprop_few_channels_s8/cutlass_tensorop_s8_i8816fprop_few_channels_s8_256x128_64x2_nhwc_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Built target cutlass_library_conv2d_sm75_s8_i8816fprop_few_channels_s8_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_pingpong_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s8_i8816fprop_fixed_channels_s8_objs.dir/generated/conv2d/75/s8_i8816fprop_fixed_channels_s8/all_sm75_s8_i8816fprop_fixed_channels_s8_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s8_i8816fprop_fixed_channels_s8_objs.dir/generated/conv2d/75/s8_i8816fprop_fixed_channels_s8/cutlass_tensorop_s8_i8816fprop_fixed_channels_s8_256x128_64x2_nhwc_align16.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs.dir/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_nosmem.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Built target cutlass_library_conv2d_sm75_s8_i8816fprop_fixed_channels_s8_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s8_i8816fprop_optimized_s8_objs.dir/generated/conv2d/75/s8_i8816fprop_optimized_s8/all_sm75_s8_i8816fprop_optimized_s8_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s8_i8816fprop_optimized_s8_objs.dir/generated/conv2d/75/s8_i8816fprop_optimized_s8/cutlass_tensorop_s8_i8816fprop_optimized_s8_256x128_64x2_nhwc_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_s8_i8816fprop_optimized_s8_objs.dir/generated/conv2d/75/s8_i8816fprop_optimized_s8/cutlass_tensorop_s8_i8816fprop_optimized_s8_256x128_64x2_nc32hw32_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u4_i8832fprop_optimized_u4_objs.dir/generated/conv2d/75/u4_i8832fprop_optimized_u4/all_sm75_u4_i8832fprop_optimized_u4_conv2d_operations.cu.o [ 74%] Built target cutlass_library_conv2d_sm75_s8_i8816fprop_optimized_s8_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u4_i8832fprop_optimized_u4_objs.dir/generated/conv2d/75/u4_i8832fprop_optimized_u4/cutlass_tensorop_u4_i8832fprop_optimized_u4_256x128_128x2_nhwc_align32.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u8_i8816fprop_few_channels_u8_objs.dir/generated/conv2d/75/u8_i8816fprop_few_channels_u8/all_sm75_u8_i8816fprop_few_channels_u8_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::half_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::half_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_f16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_f16_f16_f32_void_f16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u8_i8816fprop_few_channels_u8_objs.dir/generated/conv2d/75/u8_i8816fprop_few_channels_u8/cutlass_tensorop_u8_i8816fprop_few_channels_u8_256x128_64x2_nhwc_align16.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u4_i8832fprop_optimized_u4_objs.dir/generated/conv2d/75/u4_i8832fprop_optimized_u4/cutlass_tensorop_u4_i8832fprop_optimized_u4_256x128_128x2_nc64hw64_align32.cu.o [ 74%] Built target cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u8_i8816fprop_fixed_channels_u8_objs.dir/generated/conv2d/75/u8_i8816fprop_fixed_channels_u8/all_sm75_u8_i8816fprop_fixed_channels_u8_conv2d_operations.cu.o /builddir/build/BUILD/cutlass/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp(279): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=uint8_t, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") return ElementEChunk{storage_}; ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "cutlass::transform::kernel::SM90StructuredSparseCompressor::ElementEChunk cutlass::transform::kernel::SM90StructuredSparseCompressor::MetadataOneChunk2to4::storage() const [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 433 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::structure_sparse_compress(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 220 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::run(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 214 instantiation of "void cutlass::transform::kernel::SM90StructuredSparseCompressor::operator()(cutlass::transform::kernel::SM90StructuredSparseCompressor::Params, void *) [with ProblemShape_=cute::tuple, ElementA_=cutlass::bfloat16_t, LayoutATag_=cutlass::layout::RowMajor, SparseConfig_=cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>]" at line 119 of /builddir/build/BUILD/cutlass/include/cutlass/device_kernel.h instantiation of "void cutlass::device_kernel(Operator::Params) [with Operator=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 148 of /builddir/build/BUILD/cutlass/include/cutlass/transform/device/transform_universal_adapter.hpp instantiation of "cutlass::Status cutlass::transform::device::TransformUniversalAdapter::initialize(const cutlass::transform::device::TransformUniversalAdapter::Arguments &, void *, cudaStream_t, cutlass::CudaHostAdapter *) [with TransformKernel_=cutlass::transform::kernel::SM90StructuredSparseCompressor, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::Sm90GemmSparseConfig, cute::SM90::GMMA::Major::K, cute::sparse_elem<8, uint8_t>, cute::C<64>>>]" at line 365 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::Status cutlass::library::SparseGemmUniversal3xOperation::initialize_with_profiler_workspace(const void *, void *, void *, uint8_t **, int, cudaStream_t) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp implicit generation of "cutlass::library::SparseGemmUniversal3xOperation::~SparseGemmUniversal3xOperation() noexcept [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of class "cutlass::library::SparseGemmUniversal3xOperation [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 108 of /builddir/build/BUILD/cutlass/tools/library/src/sparse_gemm_operation_3x.hpp instantiation of "cutlass::library::SparseGemmUniversal3xOperation::SparseGemmUniversal3xOperation(const char *) [with Operator_=cutlass::gemm::device::GemmUniversalAdapter]" at line 83 of /builddir/build/BUILD/cutlass/build/tools/library/generated/gemm/90/void_s64x128x32spgemm_bf16/cutlass3x_sm90_sptensorop_s64x128x32spgemm_bf16_bf16_f32_void_bf16_128x128x64_2x1x1_0_ttn_align16_stream_k_warpspecialized_cooperative_epi_tma.cu Remark: The warnings can be suppressed with "-diag-suppress " [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u8_i8816fprop_fixed_channels_u8_objs.dir/generated/conv2d/75/u8_i8816fprop_fixed_channels_u8/cutlass_tensorop_u8_i8816fprop_fixed_channels_u8_256x128_64x2_nhwc_align16.cu.o [ 74%] Built target cutlass_library_conv2d_sm75_u8_i8816fprop_few_channels_u8_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u8_i8816fprop_optimized_u8_objs.dir/generated/conv2d/75/u8_i8816fprop_optimized_u8/all_sm75_u8_i8816fprop_optimized_u8_conv2d_operations.cu.o [ 74%] Built target cutlass_library_conv2d_sm75_u4_i8832fprop_optimized_u4_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u8_i8816fprop_optimized_u8_objs.dir/generated/conv2d/75/u8_i8816fprop_optimized_u8/cutlass_tensorop_u8_i8816fprop_optimized_u8_256x128_64x2_nhwc_align16.cu.o [ 74%] Built target cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm75_u8_i8816fprop_optimized_u8_objs.dir/generated/conv2d/75/u8_i8816fprop_optimized_u8/cutlass_tensorop_u8_i8816fprop_optimized_u8_256x128_64x2_nc32hw32_align16.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816dgrad_optimized_bf16_objs.dir/generated/conv2d/80/bf16_s16816dgrad_optimized_bf16/all_sm80_bf16_s16816dgrad_optimized_bf16_conv2d_operations.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816dgrad_optimized_bf16_objs.dir/generated/conv2d/80/bf16_s16816dgrad_optimized_bf16/cutlass_tensorop_bf16_s16816dgrad_optimized_bf16_256x128_32x3_nhwc_unity_stride_align8.cu.o [ 74%] Built target cutlass_library_conv2d_sm75_u8_i8816fprop_fixed_channels_u8_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816dgrad_optimized_bf16_objs.dir/generated/conv2d/80/bf16_s16816dgrad_optimized_bf16/cutlass_tensorop_bf16_s16816dgrad_optimized_bf16_256x128_32x3_nhwc_align8.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16_objs.dir/generated/conv2d/80/bf16_s16816fprop_fixed_channels_bf16/all_sm80_bf16_s16816fprop_fixed_channels_bf16_conv2d_operations.cu.o [ 74%] Built target cutlass_library_conv2d_sm75_u8_i8816fprop_optimized_u8_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16_objs.dir/generated/conv2d/80/bf16_s16816fprop_fixed_channels_bf16/cutlass_tensorop_bf16_s16816fprop_fixed_channels_bf16_256x128_32x3_nhwc_align4.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816fprop_optimized_bf16_objs.dir/generated/conv2d/80/bf16_s16816fprop_optimized_bf16/all_sm80_bf16_s16816fprop_optimized_bf16_conv2d_operations.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816wgrad_optimized_bf16_objs.dir/generated/conv2d/80/bf16_s16816wgrad_optimized_bf16/all_sm80_bf16_s16816wgrad_optimized_bf16_conv2d_operations.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816fprop_optimized_bf16_objs.dir/generated/conv2d/80/bf16_s16816fprop_optimized_bf16/cutlass_tensorop_bf16_s16816fprop_optimized_bf16_256x128_32x3_nhwc_align8.cu.o [ 74%] Built target cutlass_library_conv2d_sm80_bf16_s16816dgrad_optimized_bf16_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816fprop_optimized_bf16_objs.dir/generated/conv2d/80/bf16_s16816fprop_optimized_bf16/cutlass_tensorop_bf16_s16816fprop_optimized_bf16_256x128_32x3_nhwc_single_group_align8.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_bf16_s16816wgrad_optimized_bf16_objs.dir/generated/conv2d/80/bf16_s16816wgrad_optimized_bf16/cutlass_tensorop_bf16_s16816wgrad_optimized_bf16_256x128_32x3_nhwc_align8.cu.o [ 74%] Built target cutlass_library_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816dgrad_optimized_f16_objs.dir/generated/conv2d/80/f16_s16816dgrad_optimized_f16/all_sm80_f16_s16816dgrad_optimized_f16_conv2d_operations.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816dgrad_optimized_f16_objs.dir/generated/conv2d/80/f16_s16816dgrad_optimized_f16/cutlass_tensorop_f16_s16816dgrad_optimized_f16_256x128_32x3_nhwc_unity_stride_align8.cu.o [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816dgrad_optimized_f16_objs.dir/generated/conv2d/80/f16_s16816dgrad_optimized_f16/cutlass_tensorop_f16_s16816dgrad_optimized_f16_256x128_32x3_nhwc_align8.cu.o [ 74%] Built target cutlass_library_conv2d_sm80_bf16_s16816fprop_optimized_bf16_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816fprop_fixed_channels_f16_objs.dir/generated/conv2d/80/f16_s16816fprop_fixed_channels_f16/all_sm80_f16_s16816fprop_fixed_channels_f16_conv2d_operations.cu.o [ 74%] Built target cutlass_library_conv2d_sm80_bf16_s16816wgrad_optimized_bf16_objs [ 74%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816fprop_fixed_channels_f16_objs.dir/generated/conv2d/80/f16_s16816fprop_fixed_channels_f16/cutlass_tensorop_f16_s16816fprop_fixed_channels_f16_256x128_32x3_nhwc_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816fprop_optimized_f16_objs.dir/generated/conv2d/80/f16_s16816fprop_optimized_f16/all_sm80_f16_s16816fprop_optimized_f16_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816wgrad_optimized_f16_objs.dir/generated/conv2d/80/f16_s16816wgrad_optimized_f16/all_sm80_f16_s16816wgrad_optimized_f16_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816fprop_optimized_f16_objs.dir/generated/conv2d/80/f16_s16816fprop_optimized_f16/cutlass_tensorop_f16_s16816fprop_optimized_f16_256x128_32x3_nhwc_align8.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_f16_s16816dgrad_optimized_f16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816fprop_optimized_f16_objs.dir/generated/conv2d/80/f16_s16816fprop_optimized_f16/cutlass_tensorop_f16_s16816fprop_optimized_f16_256x128_32x3_nhwc_single_group_align8.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_f16_s16816wgrad_optimized_f16_objs.dir/generated/conv2d/80/f16_s16816wgrad_optimized_f16/cutlass_tensorop_f16_s16816wgrad_optimized_f16_256x128_32x3_nhwc_align8.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_f16_s16816fprop_fixed_channels_f16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816dgrad_optimized_objs.dir/generated/conv2d/80/h16816dgrad_optimized/all_sm80_h16816dgrad_optimized_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816dgrad_optimized_objs.dir/generated/conv2d/80/h16816dgrad_optimized/cutlass_tensorop_h16816dgrad_optimized_256x128_32x3_nhwc_unity_stride_align8.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816fprop_fixed_channels_objs.dir/generated/conv2d/80/h16816fprop_fixed_channels/all_sm80_h16816fprop_fixed_channels_conv2d_operations.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_f16_s16816fprop_optimized_f16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816fprop_fixed_channels_objs.dir/generated/conv2d/80/h16816fprop_fixed_channels/cutlass_tensorop_h16816fprop_fixed_channels_256x128_32x3_nhwc_align4.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_f16_s16816wgrad_optimized_f16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816dgrad_optimized_objs.dir/generated/conv2d/80/h16816dgrad_optimized/cutlass_tensorop_h16816dgrad_optimized_256x128_32x3_nhwc_align8.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816fprop_optimized_objs.dir/generated/conv2d/80/h16816fprop_optimized/all_sm80_h16816fprop_optimized_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816fprop_optimized_objs.dir/generated/conv2d/80/h16816fprop_optimized/cutlass_tensorop_h16816fprop_optimized_256x128_32x3_nhwc_align8.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816wgrad_optimized_objs.dir/generated/conv2d/80/h16816wgrad_optimized/all_sm80_h16816wgrad_optimized_conv2d_operations.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_h16816fprop_fixed_channels_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816fprop_optimized_objs.dir/generated/conv2d/80/h16816fprop_optimized/cutlass_tensorop_h16816fprop_optimized_256x128_32x3_nhwc_single_group_align8.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_h16816wgrad_optimized_objs.dir/generated/conv2d/80/h16816wgrad_optimized/cutlass_tensorop_h16816wgrad_optimized_256x128_32x3_nhwc_align8.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_h16816dgrad_optimized_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16832fprop_optimized_s8_objs.dir/generated/conv2d/80/i16832fprop_optimized_s8/all_sm80_i16832fprop_optimized_s8_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16832fprop_optimized_s8_objs.dir/generated/conv2d/80/i16832fprop_optimized_s8/cutlass_tensorop_i16832fprop_optimized_s8_256x128_64x3_nhwc_align16.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16832fprop_optimized_u8_objs.dir/generated/conv2d/80/i16832fprop_optimized_u8/all_sm80_i16832fprop_optimized_u8_conv2d_operations.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_h16816fprop_optimized_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16832fprop_optimized_u8_objs.dir/generated/conv2d/80/i16832fprop_optimized_u8/cutlass_tensorop_i16832fprop_optimized_u8_256x128_64x3_nhwc_align16.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_h16816wgrad_optimized_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16832fprop_optimized_u8_objs.dir/generated/conv2d/80/i16832fprop_optimized_u8/cutlass_tensorop_i16832fprop_optimized_u8_256x128_64x3_nhwc_single_group_align16.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16832fprop_optimized_s8_objs.dir/generated/conv2d/80/i16832fprop_optimized_s8/cutlass_tensorop_i16832fprop_optimized_s8_256x128_64x3_nhwc_single_group_align16.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16864fprop_optimized_s4_objs.dir/generated/conv2d/80/i16864fprop_optimized_s4/all_sm80_i16864fprop_optimized_s4_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16864fprop_optimized_s4_objs.dir/generated/conv2d/80/i16864fprop_optimized_s4/cutlass_tensorop_i16864fprop_optimized_s4_256x128_128x3_nhwc_align32.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16864fprop_optimized_u4_objs.dir/generated/conv2d/80/i16864fprop_optimized_u4/all_sm80_i16864fprop_optimized_u4_conv2d_operations.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_i16832fprop_optimized_u8_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16864fprop_optimized_s4_objs.dir/generated/conv2d/80/i16864fprop_optimized_s4/cutlass_tensorop_i16864fprop_optimized_s4_256x128_128x3_nhwc_single_group_align32.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_i16832fprop_optimized_s8_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16864fprop_optimized_u4_objs.dir/generated/conv2d/80/i16864fprop_optimized_u4/cutlass_tensorop_i16864fprop_optimized_u4_256x128_128x3_nhwc_align32.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816dgrad_optimized_bf16_objs.dir/generated/conv2d/80/s16816dgrad_optimized_bf16/all_sm80_s16816dgrad_optimized_bf16_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816dgrad_optimized_bf16_objs.dir/generated/conv2d/80/s16816dgrad_optimized_bf16/cutlass_tensorop_s16816dgrad_optimized_bf16_256x128_32x3_nhwc_unity_stride_align8.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816dgrad_optimized_bf16_objs.dir/generated/conv2d/80/s16816dgrad_optimized_bf16/cutlass_tensorop_s16816dgrad_optimized_bf16_256x128_32x3_nhwc_align8.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_i16864fprop_optimized_s4_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_i16864fprop_optimized_u4_objs.dir/generated/conv2d/80/i16864fprop_optimized_u4/cutlass_tensorop_i16864fprop_optimized_u4_256x128_128x3_nhwc_single_group_align32.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816dgrad_optimized_f16_objs.dir/generated/conv2d/80/s16816dgrad_optimized_f16/all_sm80_s16816dgrad_optimized_f16_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816dgrad_optimized_f16_objs.dir/generated/conv2d/80/s16816dgrad_optimized_f16/cutlass_tensorop_s16816dgrad_optimized_f16_256x128_32x3_nhwc_unity_stride_align8.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_bf16_objs.dir/generated/conv2d/80/s16816fprop_fixed_channels_bf16/all_sm80_s16816fprop_fixed_channels_bf16_conv2d_operations.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_i16864fprop_optimized_u4_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816dgrad_optimized_f16_objs.dir/generated/conv2d/80/s16816dgrad_optimized_f16/cutlass_tensorop_s16816dgrad_optimized_f16_256x128_32x3_nhwc_align8.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s16816dgrad_optimized_bf16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_bf16_objs.dir/generated/conv2d/80/s16816fprop_fixed_channels_bf16/cutlass_tensorop_s16816fprop_fixed_channels_bf16_256x128_32x3_nhwc_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_f16_objs.dir/generated/conv2d/80/s16816fprop_fixed_channels_f16/all_sm80_s16816fprop_fixed_channels_f16_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_optimized_bf16_objs.dir/generated/conv2d/80/s16816fprop_optimized_bf16/all_sm80_s16816fprop_optimized_bf16_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_f16_objs.dir/generated/conv2d/80/s16816fprop_fixed_channels_f16/cutlass_tensorop_s16816fprop_fixed_channels_f16_256x128_32x3_nhwc_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_optimized_bf16_objs.dir/generated/conv2d/80/s16816fprop_optimized_bf16/cutlass_tensorop_s16816fprop_optimized_bf16_256x128_32x3_nhwc_align8.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s16816dgrad_optimized_f16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_optimized_bf16_objs.dir/generated/conv2d/80/s16816fprop_optimized_bf16/cutlass_tensorop_s16816fprop_optimized_bf16_256x128_32x3_nhwc_single_group_align8.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_bf16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_optimized_f16_objs.dir/generated/conv2d/80/s16816fprop_optimized_f16/all_sm80_s16816fprop_optimized_f16_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_optimized_f16_objs.dir/generated/conv2d/80/s16816fprop_optimized_f16/cutlass_tensorop_s16816fprop_optimized_f16_256x128_32x3_nhwc_align8.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_f16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816fprop_optimized_f16_objs.dir/generated/conv2d/80/s16816fprop_optimized_f16/cutlass_tensorop_s16816fprop_optimized_f16_256x128_32x3_nhwc_single_group_align8.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816wgrad_optimized_bf16_objs.dir/generated/conv2d/80/s16816wgrad_optimized_bf16/all_sm80_s16816wgrad_optimized_bf16_conv2d_operations.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s16816fprop_optimized_bf16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816wgrad_optimized_bf16_objs.dir/generated/conv2d/80/s16816wgrad_optimized_bf16/cutlass_tensorop_s16816wgrad_optimized_bf16_256x128_32x3_nhwc_align8.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816wgrad_optimized_f16_objs.dir/generated/conv2d/80/s16816wgrad_optimized_f16/all_sm80_s16816wgrad_optimized_f16_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688bf16dgrad_optimized_objs.dir/generated/conv2d/80/s1688bf16dgrad_optimized/all_sm80_s1688bf16dgrad_optimized_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s16816wgrad_optimized_f16_objs.dir/generated/conv2d/80/s16816wgrad_optimized_f16/cutlass_tensorop_s16816wgrad_optimized_f16_256x128_32x3_nhwc_align8.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s16816fprop_optimized_f16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688bf16dgrad_optimized_objs.dir/generated/conv2d/80/s1688bf16dgrad_optimized/cutlass_tensorop_s1688bf16dgrad_optimized_256x128_16x3_nhwc_unity_stride_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688bf16fprop_optimized_objs.dir/generated/conv2d/80/s1688bf16fprop_optimized/all_sm80_s1688bf16fprop_optimized_conv2d_operations.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s16816wgrad_optimized_bf16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688bf16fprop_optimized_objs.dir/generated/conv2d/80/s1688bf16fprop_optimized/cutlass_tensorop_s1688bf16fprop_optimized_256x128_16x3_nhwc_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688bf16wgrad_optimized_objs.dir/generated/conv2d/80/s1688bf16wgrad_optimized/all_sm80_s1688bf16wgrad_optimized_conv2d_operations.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s16816wgrad_optimized_f16_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688bf16dgrad_optimized_objs.dir/generated/conv2d/80/s1688bf16dgrad_optimized/cutlass_tensorop_s1688bf16dgrad_optimized_256x128_16x3_nhwc_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688bf16fprop_optimized_objs.dir/generated/conv2d/80/s1688bf16fprop_optimized/cutlass_tensorop_s1688bf16fprop_optimized_256x128_16x3_nhwc_single_group_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688bf16wgrad_optimized_objs.dir/generated/conv2d/80/s1688bf16wgrad_optimized/cutlass_tensorop_s1688bf16wgrad_optimized_256x128_16x3_nhwc_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688dgrad_optimized_objs.dir/generated/conv2d/80/s1688dgrad_optimized/all_sm80_s1688dgrad_optimized_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688dgrad_optimized_objs.dir/generated/conv2d/80/s1688dgrad_optimized/cutlass_tensorop_s1688dgrad_optimized_128x128_16x4_nhwc_unity_stride_align4.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s1688bf16fprop_optimized_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688dgrad_optimized_objs.dir/generated/conv2d/80/s1688dgrad_optimized/cutlass_tensorop_s1688dgrad_optimized_128x128_16x4_nhwc_align4.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s1688bf16dgrad_optimized_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688dgrad_optimized_tf32_objs.dir/generated/conv2d/80/s1688dgrad_optimized_tf32/all_sm80_s1688dgrad_optimized_tf32_conv2d_operations.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s1688bf16wgrad_optimized_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688dgrad_optimized_tf32_objs.dir/generated/conv2d/80/s1688dgrad_optimized_tf32/cutlass_tensorop_s1688dgrad_optimized_tf32_256x128_16x3_nhwc_unity_stride_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688f16dgrad_optimized_objs.dir/generated/conv2d/80/s1688f16dgrad_optimized/all_sm80_s1688f16dgrad_optimized_conv2d_operations.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688f16dgrad_optimized_objs.dir/generated/conv2d/80/s1688f16dgrad_optimized/cutlass_tensorop_s1688f16dgrad_optimized_256x128_16x3_nhwc_unity_stride_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688dgrad_optimized_tf32_objs.dir/generated/conv2d/80/s1688dgrad_optimized_tf32/cutlass_tensorop_s1688dgrad_optimized_tf32_256x128_16x3_nhwc_align4.cu.o [ 75%] Built target cutlass_library_conv2d_sm80_s1688dgrad_optimized_objs [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688f16dgrad_optimized_objs.dir/generated/conv2d/80/s1688f16dgrad_optimized/cutlass_tensorop_s1688f16dgrad_optimized_256x128_16x3_nhwc_align4.cu.o [ 75%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688f16fprop_optimized_objs.dir/generated/conv2d/80/s1688f16fprop_optimized/all_sm80_s1688f16fprop_optimized_conv2d_operations.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688f16fprop_optimized_objs.dir/generated/conv2d/80/s1688f16fprop_optimized/cutlass_tensorop_s1688f16fprop_optimized_256x128_16x3_nhwc_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688f16wgrad_optimized_objs.dir/generated/conv2d/80/s1688f16wgrad_optimized/all_sm80_s1688f16wgrad_optimized_conv2d_operations.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688f16wgrad_optimized_objs.dir/generated/conv2d/80/s1688f16wgrad_optimized/cutlass_tensorop_s1688f16wgrad_optimized_256x128_16x3_nhwc_align4.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688dgrad_optimized_tf32_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688f16fprop_optimized_objs.dir/generated/conv2d/80/s1688f16fprop_optimized/cutlass_tensorop_s1688f16fprop_optimized_256x128_16x3_nhwc_single_group_align4.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688f16dgrad_optimized_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688fprop_optimized_objs.dir/generated/conv2d/80/s1688fprop_optimized/all_sm80_s1688fprop_optimized_conv2d_operations.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688fprop_optimized_objs.dir/generated/conv2d/80/s1688fprop_optimized/cutlass_tensorop_s1688fprop_optimized_128x128_16x4_nhwc_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688fprop_optimized_tf32_objs.dir/generated/conv2d/80/s1688fprop_optimized_tf32/all_sm80_s1688fprop_optimized_tf32_conv2d_operations.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688f16wgrad_optimized_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688fprop_optimized_tf32_objs.dir/generated/conv2d/80/s1688fprop_optimized_tf32/cutlass_tensorop_s1688fprop_optimized_tf32_256x128_16x3_nhwc_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688tf32dgrad_optimized_objs.dir/generated/conv2d/80/s1688tf32dgrad_optimized/all_sm80_s1688tf32dgrad_optimized_conv2d_operations.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688f16fprop_optimized_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688tf32dgrad_optimized_objs.dir/generated/conv2d/80/s1688tf32dgrad_optimized/cutlass_tensorop_s1688tf32dgrad_optimized_256x128_16x3_nhwc_unity_stride_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688fprop_optimized_objs.dir/generated/conv2d/80/s1688fprop_optimized/cutlass_tensorop_s1688fprop_optimized_128x128_16x4_nhwc_single_group_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688fprop_optimized_tf32_objs.dir/generated/conv2d/80/s1688fprop_optimized_tf32/cutlass_tensorop_s1688fprop_optimized_tf32_256x128_16x3_nhwc_single_group_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688tf32fprop_optimized_objs.dir/generated/conv2d/80/s1688tf32fprop_optimized/all_sm80_s1688tf32fprop_optimized_conv2d_operations.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688tf32dgrad_optimized_objs.dir/generated/conv2d/80/s1688tf32dgrad_optimized/cutlass_tensorop_s1688tf32dgrad_optimized_256x128_16x3_nhwc_align4.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688fprop_optimized_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688tf32fprop_optimized_objs.dir/generated/conv2d/80/s1688tf32fprop_optimized/cutlass_tensorop_s1688tf32fprop_optimized_256x128_16x3_nhwc_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688tf32wgrad_optimized_objs.dir/generated/conv2d/80/s1688tf32wgrad_optimized/all_sm80_s1688tf32wgrad_optimized_conv2d_operations.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688fprop_optimized_tf32_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688tf32wgrad_optimized_objs.dir/generated/conv2d/80/s1688tf32wgrad_optimized/cutlass_tensorop_s1688tf32wgrad_optimized_256x128_16x3_nhwc_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688wgrad_optimized_objs.dir/generated/conv2d/80/s1688wgrad_optimized/all_sm80_s1688wgrad_optimized_conv2d_operations.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688tf32dgrad_optimized_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688tf32fprop_optimized_objs.dir/generated/conv2d/80/s1688tf32fprop_optimized/cutlass_tensorop_s1688tf32fprop_optimized_256x128_16x3_nhwc_single_group_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688wgrad_optimized_objs.dir/generated/conv2d/80/s1688wgrad_optimized/cutlass_tensorop_s1688wgrad_optimized_128x128_16x4_nhwc_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688wgrad_optimized_tf32_objs.dir/generated/conv2d/80/s1688wgrad_optimized_tf32/all_sm80_s1688wgrad_optimized_tf32_conv2d_operations.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688tf32wgrad_optimized_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s1688wgrad_optimized_tf32_objs.dir/generated/conv2d/80/s1688wgrad_optimized_tf32/cutlass_tensorop_s1688wgrad_optimized_tf32_256x128_16x3_nhwc_align4.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s4_i16864fprop_optimized_s4_objs.dir/generated/conv2d/80/s4_i16864fprop_optimized_s4/all_sm80_s4_i16864fprop_optimized_s4_conv2d_operations.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688wgrad_optimized_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s4_i16864fprop_optimized_s4_objs.dir/generated/conv2d/80/s4_i16864fprop_optimized_s4/cutlass_tensorop_s4_i16864fprop_optimized_s4_256x128_128x3_nhwc_align32.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688tf32fprop_optimized_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s4_i16864fprop_optimized_s4_objs.dir/generated/conv2d/80/s4_i16864fprop_optimized_s4/cutlass_tensorop_s4_i16864fprop_optimized_s4_256x128_128x3_nhwc_single_group_align32.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s8_i16832fprop_few_channels_s8_objs.dir/generated/conv2d/80/s8_i16832fprop_few_channels_s8/all_sm80_s8_i16832fprop_few_channels_s8_conv2d_operations.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s1688wgrad_optimized_tf32_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s8_i16832fprop_few_channels_s8_objs.dir/generated/conv2d/80/s8_i16832fprop_few_channels_s8/cutlass_tensorop_s8_i16832fprop_few_channels_s8_256x128_64x3_nhwc_align16.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s4_i16864fprop_optimized_s4_objs.dir/generated/conv2d/80/s4_i16864fprop_optimized_s4/cutlass_tensorop_s4_i16864fprop_optimized_s4_256x128_128x3_nc64hw64_align32.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s8_i16832fprop_fixed_channels_s8_objs.dir/generated/conv2d/80/s8_i16832fprop_fixed_channels_s8/all_sm80_s8_i16832fprop_fixed_channels_s8_conv2d_operations.cu.o [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s8_i16832fprop_optimized_s8_objs.dir/generated/conv2d/80/s8_i16832fprop_optimized_s8/all_sm80_s8_i16832fprop_optimized_s8_conv2d_operations.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s8_i16832fprop_few_channels_s8_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s8_i16832fprop_optimized_s8_objs.dir/generated/conv2d/80/s8_i16832fprop_optimized_s8/cutlass_tensorop_s8_i16832fprop_optimized_s8_256x128_64x3_nhwc_align16.cu.o [ 76%] Built target cutlass_library_conv2d_sm80_s4_i16864fprop_optimized_s4_objs [ 76%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s8_i16832fprop_fixed_channels_s8_objs.dir/generated/conv2d/80/s8_i16832fprop_fixed_channels_s8/cutlass_tensorop_s8_i16832fprop_fixed_channels_s8_256x128_64x3_nhwc_align16.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s8_i16832fprop_optimized_s8_objs.dir/generated/conv2d/80/s8_i16832fprop_optimized_s8/cutlass_tensorop_s8_i16832fprop_optimized_s8_256x128_64x3_nhwc_single_group_align16.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_sdgrad_optimized_objs.dir/generated/conv2d/80/sdgrad_optimized/all_sm80_sdgrad_optimized_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_sdgrad_optimized_objs.dir/generated/conv2d/80/sdgrad_optimized/cutlass_simt_sdgrad_optimized_256x128_8x5_nhwc_unity_stride_align1.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_sdgrad_optimized_objs.dir/generated/conv2d/80/sdgrad_optimized/cutlass_simt_sdgrad_optimized_256x128_8x5_nhwc_align1.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_s8_i16832fprop_optimized_s8_objs.dir/generated/conv2d/80/s8_i16832fprop_optimized_s8/cutlass_tensorop_s8_i16832fprop_optimized_s8_256x128_64x3_nc32hw32_align16.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_s8_i16832fprop_fixed_channels_s8_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_sfprop_optimized_objs.dir/generated/conv2d/80/sfprop_optimized/all_sm80_sfprop_optimized_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_sfprop_optimized_objs.dir/generated/conv2d/80/sfprop_optimized/cutlass_simt_sfprop_optimized_256x128_8x5_nhwc_align1.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_swgrad_optimized_objs.dir/generated/conv2d/80/swgrad_optimized/all_sm80_swgrad_optimized_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_s8_i16832fprop_optimized_s8_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_swgrad_optimized_objs.dir/generated/conv2d/80/swgrad_optimized/cutlass_simt_swgrad_optimized_256x128_8x5_nhwc_align1.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_tf32_s1688dgrad_optimized_tf32_objs.dir/generated/conv2d/80/tf32_s1688dgrad_optimized_tf32/all_sm80_tf32_s1688dgrad_optimized_tf32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_tf32_s1688dgrad_optimized_tf32_objs.dir/generated/conv2d/80/tf32_s1688dgrad_optimized_tf32/cutlass_tensorop_tf32_s1688dgrad_optimized_tf32_256x128_16x3_nhwc_unity_stride_align4.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_sdgrad_optimized_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_tf32_s1688dgrad_optimized_tf32_objs.dir/generated/conv2d/80/tf32_s1688dgrad_optimized_tf32/cutlass_tensorop_tf32_s1688dgrad_optimized_tf32_256x128_16x3_nhwc_align4.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_sfprop_optimized_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_tf32_s1688fprop_optimized_tf32_objs.dir/generated/conv2d/80/tf32_s1688fprop_optimized_tf32/all_sm80_tf32_s1688fprop_optimized_tf32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_tf32_s1688fprop_optimized_tf32_objs.dir/generated/conv2d/80/tf32_s1688fprop_optimized_tf32/cutlass_tensorop_tf32_s1688fprop_optimized_tf32_256x128_16x3_nhwc_align4.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_swgrad_optimized_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_tf32_s1688fprop_optimized_tf32_objs.dir/generated/conv2d/80/tf32_s1688fprop_optimized_tf32/cutlass_tensorop_tf32_s1688fprop_optimized_tf32_256x128_16x3_nhwc_single_group_align4.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_tf32_s1688wgrad_optimized_tf32_objs.dir/generated/conv2d/80/tf32_s1688wgrad_optimized_tf32/all_sm80_tf32_s1688wgrad_optimized_tf32_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_tf32_s1688dgrad_optimized_tf32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_tf32_s1688wgrad_optimized_tf32_objs.dir/generated/conv2d/80/tf32_s1688wgrad_optimized_tf32/cutlass_tensorop_tf32_s1688wgrad_optimized_tf32_256x128_16x3_nhwc_align4.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u4_i16864fprop_optimized_u4_objs.dir/generated/conv2d/80/u4_i16864fprop_optimized_u4/all_sm80_u4_i16864fprop_optimized_u4_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u4_i16864fprop_optimized_u4_objs.dir/generated/conv2d/80/u4_i16864fprop_optimized_u4/cutlass_tensorop_u4_i16864fprop_optimized_u4_256x128_128x3_nhwc_align32.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_tf32_s1688fprop_optimized_tf32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u4_i16864fprop_optimized_u4_objs.dir/generated/conv2d/80/u4_i16864fprop_optimized_u4/cutlass_tensorop_u4_i16864fprop_optimized_u4_256x128_128x3_nhwc_single_group_align32.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u8_i16832fprop_few_channels_u8_objs.dir/generated/conv2d/80/u8_i16832fprop_few_channels_u8/all_sm80_u8_i16832fprop_few_channels_u8_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u8_i16832fprop_few_channels_u8_objs.dir/generated/conv2d/80/u8_i16832fprop_few_channels_u8/cutlass_tensorop_u8_i16832fprop_few_channels_u8_256x128_64x3_nhwc_align16.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_tf32_s1688wgrad_optimized_tf32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u8_i16832fprop_fixed_channels_u8_objs.dir/generated/conv2d/80/u8_i16832fprop_fixed_channels_u8/all_sm80_u8_i16832fprop_fixed_channels_u8_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u8_i16832fprop_optimized_u8_objs.dir/generated/conv2d/80/u8_i16832fprop_optimized_u8/all_sm80_u8_i16832fprop_optimized_u8_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u4_i16864fprop_optimized_u4_objs.dir/generated/conv2d/80/u4_i16864fprop_optimized_u4/cutlass_tensorop_u4_i16864fprop_optimized_u4_256x128_128x3_nc64hw64_align32.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u8_i16832fprop_fixed_channels_u8_objs.dir/generated/conv2d/80/u8_i16832fprop_fixed_channels_u8/cutlass_tensorop_u8_i16832fprop_fixed_channels_u8_256x128_64x3_nhwc_align16.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u8_i16832fprop_optimized_u8_objs.dir/generated/conv2d/80/u8_i16832fprop_optimized_u8/cutlass_tensorop_u8_i16832fprop_optimized_u8_256x128_64x3_nhwc_align16.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_u8_i16832fprop_few_channels_u8_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u8_i16832fprop_optimized_u8_objs.dir/generated/conv2d/80/u8_i16832fprop_optimized_u8/cutlass_tensorop_u8_i16832fprop_optimized_u8_256x128_64x3_nhwc_single_group_align16.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_u4_i16864fprop_optimized_u4_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm80_u8_i16832fprop_optimized_u8_objs.dir/generated/conv2d/80/u8_i16832fprop_optimized_u8/cutlass_tensorop_u8_i16832fprop_optimized_u8_256x128_64x3_nc32hw32_align16.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_u8_i16832fprop_fixed_channels_u8_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_128x128x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm80_u8_i16832fprop_optimized_u8_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_128x128x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_128x128x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_128x128x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16_128x192x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16_128x192x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_128x256x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_128x256x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_128x256x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_128x256x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_128x256x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_128x256x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_128x256x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_128x256x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_256x128x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_256x128x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_256x128x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_256x128x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_256x128x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_256x128x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_256x128x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_256x128x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_256x64x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_256x64x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_256x64x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_256x64x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16_256x96x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16_256x96x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_64x128x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_64x128x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_64x128x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_64x128x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_64x256x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_64x256x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_64x256x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_64x256x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_64x64x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_64x64x64_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_64x64x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16/all_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs.dir/generated/conv2d/90/f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16/cutlass3x_sm90_tensorop_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_64x64x32_1x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32/all_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32_128x192x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32/all_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_128x256x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32/all_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32/all_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_128x256x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_128x256x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32/all_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_128x256x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32/all_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32_128x256x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32/all_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32/all_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_256x128x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_256x128x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32/all_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_256x128x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32/all_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_256x128x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32/all_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32/all_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32_256x128x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32_256x96x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32/all_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32_64x64x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32_64x64x64_1x2x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32/all_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32/all_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32_64x64x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32_64x64x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32_64x64x64_1x2x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs.dir/generated/conv2d/90/f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32_64x64x32_1x2x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32/all_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32/cutlass3x_sm90_tensorop_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32_128x256x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32/all_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32_conv2d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32/cutlass3x_sm90_tensorop_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32_128x256x128_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32/all_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32/all_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32/cutlass3x_sm90_tensorop_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32_256x128x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32/cutlass3x_sm90_tensorop_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32_256x128x128_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32/all_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32_conv2d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32/cutlass3x_sm90_tensorop_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32_64x64x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs.dir/generated/conv2d/90/s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32/cutlass3x_sm90_tensorop_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32_64x64x64_1x2x1_0_align16_warpspecialized_epi_tma.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16_objs.dir/generated/conv3d/80/bf16_s16816dgrad3d_analytic_bf16/all_sm80_bf16_s16816dgrad3d_analytic_bf16_conv3d_operations.cu.o [ 77%] Built target cutlass_library_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16_objs.dir/generated/conv3d/80/bf16_s16816dgrad3d_analytic_bf16/cutlass_tensorop_bf16_s16816dgrad3d_analytic_bf16_256x128_32x3.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16_objs.dir/generated/conv3d/80/bf16_s16816dgrad3d_optimized_bf16/all_sm80_bf16_s16816dgrad3d_optimized_bf16_conv3d_operations.cu.o [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16_objs.dir/generated/conv3d/80/bf16_s16816dgrad3d_optimized_bf16/cutlass_tensorop_bf16_s16816dgrad3d_optimized_bf16_256x128_32x3_unity_stride.cu.o [ 77%] Built target cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16_objs [ 77%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16_objs.dir/generated/conv3d/80/bf16_s16816fprop3d_optimized_bf16/all_sm80_bf16_s16816fprop3d_optimized_bf16_conv3d_operations.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16_objs.dir/generated/conv3d/80/bf16_s16816wgrad3d_optimized_bf16/all_sm80_bf16_s16816wgrad3d_optimized_bf16_conv3d_operations.cu.o [ 78%] Built target cutlass_library_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16_objs.dir/generated/conv3d/80/bf16_s16816fprop3d_optimized_bf16/cutlass_tensorop_bf16_s16816fprop3d_optimized_bf16_256x128_32x3.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16_objs.dir/generated/conv3d/80/bf16_s16816wgrad3d_optimized_bf16/cutlass_tensorop_bf16_s16816wgrad3d_optimized_bf16_256x128_32x3.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_f16_s16816dgrad3d_analytic_f16_objs.dir/generated/conv3d/80/f16_s16816dgrad3d_analytic_f16/all_sm80_f16_s16816dgrad3d_analytic_f16_conv3d_operations.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_f16_s16816dgrad3d_optimized_f16_objs.dir/generated/conv3d/80/f16_s16816dgrad3d_optimized_f16/all_sm80_f16_s16816dgrad3d_optimized_f16_conv3d_operations.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_f16_s16816dgrad3d_analytic_f16_objs.dir/generated/conv3d/80/f16_s16816dgrad3d_analytic_f16/cutlass_tensorop_f16_s16816dgrad3d_analytic_f16_256x128_32x3.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_f16_s16816dgrad3d_optimized_f16_objs.dir/generated/conv3d/80/f16_s16816dgrad3d_optimized_f16/cutlass_tensorop_f16_s16816dgrad3d_optimized_f16_256x128_32x3_unity_stride.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_f16_s16816fprop3d_optimized_f16_objs.dir/generated/conv3d/80/f16_s16816fprop3d_optimized_f16/all_sm80_f16_s16816fprop3d_optimized_f16_conv3d_operations.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_f16_s16816fprop3d_optimized_f16_objs.dir/generated/conv3d/80/f16_s16816fprop3d_optimized_f16/cutlass_tensorop_f16_s16816fprop3d_optimized_f16_256x128_32x3.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_f16_s16816wgrad3d_optimized_f16_objs.dir/generated/conv3d/80/f16_s16816wgrad3d_optimized_f16/all_sm80_f16_s16816wgrad3d_optimized_f16_conv3d_operations.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_f16_s16816dgrad3d_optimized_f16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_h16816dgrad3d_analytic_objs.dir/generated/conv3d/80/h16816dgrad3d_analytic/all_sm80_h16816dgrad3d_analytic_conv3d_operations.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_f16_s16816dgrad3d_analytic_f16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_f16_s16816wgrad3d_optimized_f16_objs.dir/generated/conv3d/80/f16_s16816wgrad3d_optimized_f16/cutlass_tensorop_f16_s16816wgrad3d_optimized_f16_256x128_32x3.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_h16816dgrad3d_optimized_objs.dir/generated/conv3d/80/h16816dgrad3d_optimized/all_sm80_h16816dgrad3d_optimized_conv3d_operations.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_h16816dgrad3d_analytic_objs.dir/generated/conv3d/80/h16816dgrad3d_analytic/cutlass_tensorop_h16816dgrad3d_analytic_256x128_32x3.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_f16_s16816fprop3d_optimized_f16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_h16816fprop3d_optimized_objs.dir/generated/conv3d/80/h16816fprop3d_optimized/all_sm80_h16816fprop3d_optimized_conv3d_operations.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_h16816dgrad3d_optimized_objs.dir/generated/conv3d/80/h16816dgrad3d_optimized/cutlass_tensorop_h16816dgrad3d_optimized_256x128_32x3_unity_stride.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_h16816fprop3d_optimized_objs.dir/generated/conv3d/80/h16816fprop3d_optimized/cutlass_tensorop_h16816fprop3d_optimized_256x128_32x3.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_f16_s16816wgrad3d_optimized_f16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_h16816wgrad3d_optimized_objs.dir/generated/conv3d/80/h16816wgrad3d_optimized/all_sm80_h16816wgrad3d_optimized_conv3d_operations.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_h16816dgrad3d_analytic_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_bf16_objs.dir/generated/conv3d/80/s16816dgrad3d_analytic_bf16/all_sm80_s16816dgrad3d_analytic_bf16_conv3d_operations.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_h16816wgrad3d_optimized_objs.dir/generated/conv3d/80/h16816wgrad3d_optimized/cutlass_tensorop_h16816wgrad3d_optimized_256x128_32x3.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_h16816dgrad3d_optimized_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_f16_objs.dir/generated/conv3d/80/s16816dgrad3d_analytic_f16/all_sm80_s16816dgrad3d_analytic_f16_conv3d_operations.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_bf16_objs.dir/generated/conv3d/80/s16816dgrad3d_analytic_bf16/cutlass_tensorop_s16816dgrad3d_analytic_bf16_256x128_32x3.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_f16_objs.dir/generated/conv3d/80/s16816dgrad3d_analytic_f16/cutlass_tensorop_s16816dgrad3d_analytic_f16_256x128_32x3.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_h16816fprop3d_optimized_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_bf16_objs.dir/generated/conv3d/80/s16816dgrad3d_optimized_bf16/all_sm80_s16816dgrad3d_optimized_bf16_conv3d_operations.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_h16816wgrad3d_optimized_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_f16_objs.dir/generated/conv3d/80/s16816dgrad3d_optimized_f16/all_sm80_s16816dgrad3d_optimized_f16_conv3d_operations.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_bf16_objs.dir/generated/conv3d/80/s16816dgrad3d_optimized_bf16/cutlass_tensorop_s16816dgrad3d_optimized_bf16_256x128_32x3_unity_stride.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_f16_objs.dir/generated/conv3d/80/s16816dgrad3d_optimized_f16/cutlass_tensorop_s16816dgrad3d_optimized_f16_256x128_32x3_unity_stride.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_bf16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816fprop3d_optimized_bf16_objs.dir/generated/conv3d/80/s16816fprop3d_optimized_bf16/all_sm80_s16816fprop3d_optimized_bf16_conv3d_operations.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_f16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816fprop3d_optimized_f16_objs.dir/generated/conv3d/80/s16816fprop3d_optimized_f16/all_sm80_s16816fprop3d_optimized_f16_conv3d_operations.cu.o [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816fprop3d_optimized_bf16_objs.dir/generated/conv3d/80/s16816fprop3d_optimized_bf16/cutlass_tensorop_s16816fprop3d_optimized_bf16_256x128_32x3.cu.o [ 78%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_bf16_objs [ 78%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_bf16_objs.dir/generated/conv3d/80/s16816wgrad3d_optimized_bf16/all_sm80_s16816wgrad3d_optimized_bf16_conv3d_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816fprop3d_optimized_f16_objs.dir/generated/conv3d/80/s16816fprop3d_optimized_f16/cutlass_tensorop_s16816fprop3d_optimized_f16_256x128_32x3.cu.o [ 79%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_f16_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_f16_objs.dir/generated/conv3d/80/s16816wgrad3d_optimized_f16/all_sm80_s16816wgrad3d_optimized_f16_conv3d_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_bf16_objs.dir/generated/conv3d/80/s16816wgrad3d_optimized_bf16/cutlass_tensorop_s16816wgrad3d_optimized_bf16_256x128_32x3.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_f16_objs.dir/generated/conv3d/80/s16816wgrad3d_optimized_f16/cutlass_tensorop_s16816wgrad3d_optimized_f16_256x128_32x3.cu.o [ 79%] Built target cutlass_library_conv3d_sm80_s16816fprop3d_optimized_bf16_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32_objs.dir/generated/conv3d/90/f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32/all_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32_conv3d_operations.cu.o [ 79%] Built target cutlass_library_conv3d_sm80_s16816fprop3d_optimized_f16_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32_objs.dir/generated/conv3d/90/f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32/all_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32_conv3d_operations.cu.o [ 79%] Built target cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_bf16_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32_objs.dir/generated/conv3d/90/f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32_64x64x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32_objs.dir/generated/conv3d/90/f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32_64x64x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32_objs.dir/generated/conv3d/90/f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32/all_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32_conv3d_operations.cu.o [ 79%] Built target cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_f16_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32_objs.dir/generated/conv3d/90/f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32_64x64x64_1x2x1_0_align16_warpspecialized_epi_tma.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32_objs.dir/generated/conv3d/90/f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32_64x64x32_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32_objs.dir/generated/conv3d/90/f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32_64x64x64_1x2x1_0_align16_warpspecialized_epi_tma.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32_objs.dir/generated/conv3d/90/f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32/cutlass3x_sm90_tensorop_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32_64x64x32_1x2x1_0_align16_warpspecialized_epi_tma.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32_objs.dir/generated/conv3d/90/s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32/all_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32_conv3d_operations.cu.o [ 79%] Built target cutlass_library_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688herk_objs.dir/generated/rank_k/80/c1688herk/all_sm80_c1688herk_rank_k_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32_objs.dir/generated/conv3d/90/s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32/cutlass3x_sm90_tensorop_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32_64x64x64_2x1x1_0_align16_warpspecialized_epi_tma.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688herk_objs.dir/generated/rank_k/80/c1688herk/cutlass_tensorop_c1688herk_128x64_16x4_n_l_align1.cu.o [ 79%] Built target cutlass_library_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688herk_objs.dir/generated/rank_k/80/c1688herk/cutlass_tensorop_c1688herk_128x64_16x4_n_u_align1.cu.o [ 79%] Built target cutlass_library_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32_objs.dir/generated/conv3d/90/s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32/cutlass3x_sm90_tensorop_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32_64x64x64_1x2x1_0_align16_warpspecialized_epi_tma.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688syrk_objs.dir/generated/rank_k/80/c1688syrk/all_sm80_c1688syrk_rank_k_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688syrk_objs.dir/generated/rank_k/80/c1688syrk/cutlass_tensorop_c1688syrk_128x64_16x4_n_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688syrk_objs.dir/generated/rank_k/80/c1688syrk/cutlass_tensorop_c1688syrk_128x64_16x4_n_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688herk_objs.dir/generated/rank_k/80/c1688herk/cutlass_tensorop_c1688herk_128x64_16x4_h_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32herk_objs.dir/generated/rank_k/80/c1688tf32herk/all_sm80_c1688tf32herk_rank_k_operations.cu.o [ 79%] Built target cutlass_library_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688herk_objs.dir/generated/rank_k/80/c1688herk/cutlass_tensorop_c1688herk_128x64_16x4_h_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688syrk_objs.dir/generated/rank_k/80/c1688syrk/cutlass_tensorop_c1688syrk_128x64_16x4_t_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32herk_objs.dir/generated/rank_k/80/c1688tf32herk/cutlass_tensorop_c1688tf32herk_128x64_16x4_n_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688syrk_objs.dir/generated/rank_k/80/c1688syrk/cutlass_tensorop_c1688syrk_128x64_16x4_t_u_align1.cu.o [ 79%] Built target cutlass_library_rank_k_sm80_c1688herk_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32herk_objs.dir/generated/rank_k/80/c1688tf32herk/cutlass_tensorop_c1688tf32herk_128x64_16x4_n_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_d884syrk_objs.dir/generated/rank_k/80/d884syrk/all_sm80_d884syrk_rank_k_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884herk_objs.dir/generated/rank_k/80/gz884herk/all_sm80_gz884herk_rank_k_operations.cu.o [ 79%] Built target cutlass_library_rank_k_sm80_c1688syrk_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32herk_objs.dir/generated/rank_k/80/c1688tf32herk/cutlass_tensorop_c1688tf32herk_128x64_16x4_h_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_d884syrk_objs.dir/generated/rank_k/80/d884syrk/cutlass_tensorop_d884syrk_128x128_16x3_n_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884herk_objs.dir/generated/rank_k/80/gz884herk/cutlass_tensorop_gz884herk_64x64_8x3_n_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_d884syrk_objs.dir/generated/rank_k/80/d884syrk/cutlass_tensorop_d884syrk_128x128_16x3_n_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884herk_objs.dir/generated/rank_k/80/gz884herk/cutlass_tensorop_gz884herk_64x64_8x3_n_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_c1688tf32herk_objs.dir/generated/rank_k/80/c1688tf32herk/cutlass_tensorop_c1688tf32herk_128x64_16x4_h_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884syrk_objs.dir/generated/rank_k/80/gz884syrk/all_sm80_gz884syrk_rank_k_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884syrk_objs.dir/generated/rank_k/80/gz884syrk/cutlass_tensorop_gz884syrk_64x64_8x3_n_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884herk_objs.dir/generated/rank_k/80/gz884herk/cutlass_tensorop_gz884herk_64x64_8x3_h_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_d884syrk_objs.dir/generated/rank_k/80/d884syrk/cutlass_tensorop_d884syrk_128x128_16x3_t_l_align1.cu.o [ 79%] Built target cutlass_library_rank_k_sm80_c1688tf32herk_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884syrk_objs.dir/generated/rank_k/80/gz884syrk/cutlass_tensorop_gz884syrk_64x64_8x3_n_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884herk_objs.dir/generated/rank_k/80/gz884herk/cutlass_tensorop_gz884herk_64x64_8x3_h_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_d884syrk_objs.dir/generated/rank_k/80/d884syrk/cutlass_tensorop_d884syrk_128x128_16x3_t_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884syrk_objs.dir/generated/rank_k/80/gz884syrk/cutlass_tensorop_gz884syrk_64x64_8x3_t_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_gz884syrk_objs.dir/generated/rank_k/80/gz884syrk/cutlass_tensorop_gz884syrk_64x64_8x3_t_u_align1.cu.o [ 79%] Built target cutlass_library_rank_k_sm80_gz884herk_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688syrk_objs.dir/generated/rank_k/80/s1688syrk/all_sm80_s1688syrk_rank_k_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688tf32syrk_objs.dir/generated/rank_k/80/s1688tf32syrk/all_sm80_s1688tf32syrk_rank_k_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688syrk_objs.dir/generated/rank_k/80/s1688syrk/cutlass_tensorop_s1688syrk_256x128_16x3_n_l_align1.cu.o [ 79%] Built target cutlass_library_rank_k_sm80_d884syrk_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688syrk_objs.dir/generated/rank_k/80/s1688syrk/cutlass_tensorop_s1688syrk_256x128_16x3_n_u_align1.cu.o [ 79%] Built target cutlass_library_rank_k_sm80_gz884syrk_objs [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688syrk_objs.dir/generated/rank_k/80/s1688syrk/cutlass_tensorop_s1688syrk_256x128_16x3_t_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688tf32syrk_objs.dir/generated/rank_k/80/s1688tf32syrk/cutlass_tensorop_s1688tf32syrk_256x128_16x3_n_l_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688tf32syrk_objs.dir/generated/rank_k/80/s1688tf32syrk/cutlass_tensorop_s1688tf32syrk_256x128_16x3_n_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884syrk_objs.dir/generated/rank_k/80/z884syrk/all_sm80_z884syrk_rank_k_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688syrk_objs.dir/generated/rank_k/80/s1688syrk/cutlass_tensorop_s1688syrk_256x128_16x3_t_u_align1.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_d1684syrk_objs.dir/generated/rank_k/90/d1684syrk/all_sm90_d1684syrk_rank_k_operations.cu.o [ 79%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884syrk_objs.dir/generated/rank_k/80/z884syrk/cutlass_tensorop_z884syrk_128x64_8x3_n_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_d1684syrk_objs.dir/generated/rank_k/90/d1684syrk/cutlass_tensorop_d1684syrk_128x128x16_1x1x1_3_n_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884syrk_objs.dir/generated/rank_k/80/z884syrk/cutlass_tensorop_z884syrk_128x64_8x3_n_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688tf32syrk_objs.dir/generated/rank_k/80/s1688tf32syrk/cutlass_tensorop_s1688tf32syrk_256x128_16x3_t_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_d1684syrk_objs.dir/generated/rank_k/90/d1684syrk/cutlass_tensorop_d1684syrk_128x128x16_1x1x1_3_n_u_align1.cu.o [ 80%] Built target cutlass_library_rank_k_sm80_s1688syrk_objs [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884syrk_objs.dir/generated/rank_k/80/z884syrk/cutlass_tensorop_z884syrk_128x64_8x3_t_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_d1684syrk_objs.dir/generated/rank_k/90/d1684syrk/cutlass_tensorop_d1684syrk_128x128x16_1x1x1_3_t_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_z884syrk_objs.dir/generated/rank_k/80/z884syrk/cutlass_tensorop_z884syrk_128x64_8x3_t_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm80_s1688tf32syrk_objs.dir/generated/rank_k/80/s1688tf32syrk/cutlass_tensorop_s1688tf32syrk_256x128_16x3_t_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_d1684syrk_objs.dir/generated/rank_k/90/d1684syrk/cutlass_tensorop_d1684syrk_128x128x16_1x1x1_3_t_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684herk_objs.dir/generated/rank_k/90/gz1684herk/all_sm90_gz1684herk_rank_k_operations.cu.o [ 80%] Built target cutlass_library_rank_k_sm80_z884syrk_objs [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684syrk_objs.dir/generated/rank_k/90/gz1684syrk/all_sm90_gz1684syrk_rank_k_operations.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684herk_objs.dir/generated/rank_k/90/gz1684herk/cutlass_tensorop_gz1684herk_64x64x8_1x1x1_3_n_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684syrk_objs.dir/generated/rank_k/90/gz1684syrk/cutlass_tensorop_gz1684syrk_64x64x8_1x1x1_3_n_l_align1.cu.o [ 80%] Built target cutlass_library_rank_k_sm90_d1684syrk_objs [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684syrk_objs.dir/generated/rank_k/90/gz1684syrk/cutlass_tensorop_gz1684syrk_64x64x8_1x1x1_3_n_u_align1.cu.o [ 80%] Built target cutlass_library_rank_k_sm80_s1688tf32syrk_objs [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684syrk_objs.dir/generated/rank_k/90/gz1684syrk/cutlass_tensorop_gz1684syrk_64x64x8_1x1x1_3_t_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684herk_objs.dir/generated/rank_k/90/gz1684herk/cutlass_tensorop_gz1684herk_64x64x8_1x1x1_3_n_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684herk_objs.dir/generated/rank_k/90/gz1684herk/cutlass_tensorop_gz1684herk_64x64x8_1x1x1_3_h_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684herk_objs.dir/generated/rank_k/90/z1684herk/all_sm90_z1684herk_rank_k_operations.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684syrk_objs.dir/generated/rank_k/90/gz1684syrk/cutlass_tensorop_gz1684syrk_64x64x8_1x1x1_3_t_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684syrk_objs.dir/generated/rank_k/90/z1684syrk/all_sm90_z1684syrk_rank_k_operations.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684herk_objs.dir/generated/rank_k/90/z1684herk/cutlass_tensorop_z1684herk_128x64x8_1x1x1_3_n_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_gz1684herk_objs.dir/generated/rank_k/90/gz1684herk/cutlass_tensorop_gz1684herk_64x64x8_1x1x1_3_h_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684syrk_objs.dir/generated/rank_k/90/z1684syrk/cutlass_tensorop_z1684syrk_128x64x8_1x1x1_3_n_l_align1.cu.o [ 80%] Built target cutlass_library_rank_k_sm90_gz1684syrk_objs [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684herk_objs.dir/generated/rank_k/90/z1684herk/cutlass_tensorop_z1684herk_128x64x8_1x1x1_3_n_u_align1.cu.o [ 80%] Built target cutlass_library_rank_k_sm90_gz1684herk_objs [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684syrk_objs.dir/generated/rank_k/90/z1684syrk/cutlass_tensorop_z1684syrk_128x64x8_1x1x1_3_n_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684syrk_objs.dir/generated/rank_k/90/z1684syrk/cutlass_tensorop_z1684syrk_128x64x8_1x1x1_3_t_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688her2k_objs.dir/generated/rank_2k/80/c1688her2k/all_sm80_c1688her2k_rank_2k_operations.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688her2k_objs.dir/generated/rank_2k/80/c1688her2k/cutlass_tensorop_c1688her2k_128x64_16x4_n_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684herk_objs.dir/generated/rank_k/90/z1684herk/cutlass_tensorop_z1684herk_128x64x8_1x1x1_3_h_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688syr2k_objs.dir/generated/rank_2k/80/c1688syr2k/all_sm80_c1688syr2k_rank_2k_operations.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684syrk_objs.dir/generated/rank_k/90/z1684syrk/cutlass_tensorop_z1684syrk_128x64x8_1x1x1_3_t_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688syr2k_objs.dir/generated/rank_2k/80/c1688syr2k/cutlass_tensorop_c1688syr2k_128x64_16x4_n_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_k_sm90_z1684herk_objs.dir/generated/rank_k/90/z1684herk/cutlass_tensorop_z1684herk_128x64x8_1x1x1_3_h_u_align1.cu.o [ 80%] Built target cutlass_library_rank_k_sm90_z1684syrk_objs [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688her2k_objs.dir/generated/rank_2k/80/c1688her2k/cutlass_tensorop_c1688her2k_128x64_16x4_n_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688syr2k_objs.dir/generated/rank_2k/80/c1688syr2k/cutlass_tensorop_c1688syr2k_128x64_16x4_n_u_align1.cu.o [ 80%] Built target cutlass_library_rank_k_sm90_z1684herk_objs [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688her2k_objs.dir/generated/rank_2k/80/c1688her2k/cutlass_tensorop_c1688her2k_128x64_16x4_h_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688her2k_objs.dir/generated/rank_2k/80/c1688her2k/cutlass_tensorop_c1688her2k_128x64_16x4_h_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32her2k_objs.dir/generated/rank_2k/80/c1688tf32her2k/all_sm80_c1688tf32her2k_rank_2k_operations.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688syr2k_objs.dir/generated/rank_2k/80/c1688syr2k/cutlass_tensorop_c1688syr2k_128x64_16x4_t_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32her2k_objs.dir/generated/rank_2k/80/c1688tf32her2k/cutlass_tensorop_c1688tf32her2k_128x64_16x4_n_l_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32her2k_objs.dir/generated/rank_2k/80/c1688tf32her2k/cutlass_tensorop_c1688tf32her2k_128x64_16x4_n_u_align1.cu.o [ 80%] Built target cutlass_library_rank_2k_sm80_c1688her2k_objs [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688syr2k_objs.dir/generated/rank_2k/80/c1688syr2k/cutlass_tensorop_c1688syr2k_128x64_16x4_t_u_align1.cu.o [ 80%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32her2k_objs.dir/generated/rank_2k/80/c1688tf32her2k/cutlass_tensorop_c1688tf32her2k_128x64_16x4_h_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32syr2k_objs.dir/generated/rank_2k/80/c1688tf32syr2k/all_sm80_c1688tf32syr2k_rank_2k_operations.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32syr2k_objs.dir/generated/rank_2k/80/c1688tf32syr2k/cutlass_tensorop_c1688tf32syr2k_128x64_16x4_n_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_d884syr2k_objs.dir/generated/rank_2k/80/d884syr2k/all_sm80_d884syr2k_rank_2k_operations.cu.o [ 81%] Built target cutlass_library_rank_2k_sm80_c1688syr2k_objs [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_d884syr2k_objs.dir/generated/rank_2k/80/d884syr2k/cutlass_tensorop_d884syr2k_128x128_16x3_n_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32syr2k_objs.dir/generated/rank_2k/80/c1688tf32syr2k/cutlass_tensorop_c1688tf32syr2k_128x64_16x4_n_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32her2k_objs.dir/generated/rank_2k/80/c1688tf32her2k/cutlass_tensorop_c1688tf32her2k_128x64_16x4_h_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884her2k_objs.dir/generated/rank_2k/80/gz884her2k/all_sm80_gz884her2k_rank_2k_operations.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_d884syr2k_objs.dir/generated/rank_2k/80/d884syr2k/cutlass_tensorop_d884syr2k_128x128_16x3_n_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884her2k_objs.dir/generated/rank_2k/80/gz884her2k/cutlass_tensorop_gz884her2k_64x64_8x3_n_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32syr2k_objs.dir/generated/rank_2k/80/c1688tf32syr2k/cutlass_tensorop_c1688tf32syr2k_128x64_16x4_t_l_align1.cu.o [ 81%] Built target cutlass_library_rank_2k_sm80_c1688tf32her2k_objs [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_d884syr2k_objs.dir/generated/rank_2k/80/d884syr2k/cutlass_tensorop_d884syr2k_128x128_16x3_t_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884her2k_objs.dir/generated/rank_2k/80/gz884her2k/cutlass_tensorop_gz884her2k_64x64_8x3_n_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_c1688tf32syr2k_objs.dir/generated/rank_2k/80/c1688tf32syr2k/cutlass_tensorop_c1688tf32syr2k_128x64_16x4_t_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884syr2k_objs.dir/generated/rank_2k/80/gz884syr2k/all_sm80_gz884syr2k_rank_2k_operations.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884syr2k_objs.dir/generated/rank_2k/80/gz884syr2k/cutlass_tensorop_gz884syr2k_64x64_8x3_n_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884her2k_objs.dir/generated/rank_2k/80/gz884her2k/cutlass_tensorop_gz884her2k_64x64_8x3_h_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_d884syr2k_objs.dir/generated/rank_2k/80/d884syr2k/cutlass_tensorop_d884syr2k_128x128_16x3_t_u_align1.cu.o [ 81%] Built target cutlass_library_rank_2k_sm80_c1688tf32syr2k_objs [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884syr2k_objs.dir/generated/rank_2k/80/gz884syr2k/cutlass_tensorop_gz884syr2k_64x64_8x3_n_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884her2k_objs.dir/generated/rank_2k/80/gz884her2k/cutlass_tensorop_gz884her2k_64x64_8x3_h_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688syr2k_objs.dir/generated/rank_2k/80/s1688syr2k/all_sm80_s1688syr2k_rank_2k_operations.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884syr2k_objs.dir/generated/rank_2k/80/gz884syr2k/cutlass_tensorop_gz884syr2k_64x64_8x3_t_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688syr2k_objs.dir/generated/rank_2k/80/s1688syr2k/cutlass_tensorop_s1688syr2k_256x128_16x3_n_l_align1.cu.o [ 81%] Built target cutlass_library_rank_2k_sm80_gz884her2k_objs [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_gz884syr2k_objs.dir/generated/rank_2k/80/gz884syr2k/cutlass_tensorop_gz884syr2k_64x64_8x3_t_u_align1.cu.o [ 81%] Built target cutlass_library_rank_2k_sm80_d884syr2k_objs [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688syr2k_objs.dir/generated/rank_2k/80/s1688syr2k/cutlass_tensorop_s1688syr2k_256x128_16x3_n_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688syr2k_objs.dir/generated/rank_2k/80/s1688syr2k/cutlass_tensorop_s1688syr2k_256x128_16x3_t_l_align1.cu.o [ 81%] Built target cutlass_library_rank_2k_sm80_gz884syr2k_objs [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688syr2k_objs.dir/generated/rank_2k/80/s1688syr2k/cutlass_tensorop_s1688syr2k_256x128_16x3_t_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688tf32syr2k_objs.dir/generated/rank_2k/80/s1688tf32syr2k/all_sm80_s1688tf32syr2k_rank_2k_operations.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688tf32syr2k_objs.dir/generated/rank_2k/80/s1688tf32syr2k/cutlass_tensorop_s1688tf32syr2k_256x128_16x3_n_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884her2k_objs.dir/generated/rank_2k/80/z884her2k/all_sm80_z884her2k_rank_2k_operations.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884her2k_objs.dir/generated/rank_2k/80/z884her2k/cutlass_tensorop_z884her2k_128x64_8x3_n_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884her2k_objs.dir/generated/rank_2k/80/z884her2k/cutlass_tensorop_z884her2k_128x64_8x3_n_u_align1.cu.o [ 81%] Built target cutlass_library_rank_2k_sm80_s1688syr2k_objs [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884her2k_objs.dir/generated/rank_2k/80/z884her2k/cutlass_tensorop_z884her2k_128x64_8x3_h_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688tf32syr2k_objs.dir/generated/rank_2k/80/s1688tf32syr2k/cutlass_tensorop_s1688tf32syr2k_256x128_16x3_n_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688tf32syr2k_objs.dir/generated/rank_2k/80/s1688tf32syr2k/cutlass_tensorop_s1688tf32syr2k_256x128_16x3_t_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884syr2k_objs.dir/generated/rank_2k/80/z884syr2k/all_sm80_z884syr2k_rank_2k_operations.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884her2k_objs.dir/generated/rank_2k/80/z884her2k/cutlass_tensorop_z884her2k_128x64_8x3_h_u_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884syr2k_objs.dir/generated/rank_2k/80/z884syr2k/cutlass_tensorop_z884syr2k_128x64_8x3_n_l_align1.cu.o [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884syr2k_objs.dir/generated/rank_2k/80/z884syr2k/cutlass_tensorop_z884syr2k_128x64_8x3_n_u_align1.cu.o [ 81%] Built target cutlass_library_rank_2k_sm80_z884her2k_objs [ 81%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884syr2k_objs.dir/generated/rank_2k/80/z884syr2k/cutlass_tensorop_z884syr2k_128x64_8x3_t_l_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_d1684syr2k_objs.dir/generated/rank_2k/90/d1684syr2k/all_sm90_d1684syr2k_rank_2k_operations.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_s1688tf32syr2k_objs.dir/generated/rank_2k/80/s1688tf32syr2k/cutlass_tensorop_s1688tf32syr2k_256x128_16x3_t_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_d1684syr2k_objs.dir/generated/rank_2k/90/d1684syr2k/cutlass_tensorop_d1684syr2k_128x128x16_1x1x1_3_n_l_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684her2k_objs.dir/generated/rank_2k/90/gz1684her2k/all_sm90_gz1684her2k_rank_2k_operations.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm80_z884syr2k_objs.dir/generated/rank_2k/80/z884syr2k/cutlass_tensorop_z884syr2k_128x64_8x3_t_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684her2k_objs.dir/generated/rank_2k/90/gz1684her2k/cutlass_tensorop_gz1684her2k_64x64x8_1x1x1_3_n_l_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_d1684syr2k_objs.dir/generated/rank_2k/90/d1684syr2k/cutlass_tensorop_d1684syr2k_128x128x16_1x1x1_3_n_u_align1.cu.o [ 82%] Built target cutlass_library_rank_2k_sm80_z884syr2k_objs [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684her2k_objs.dir/generated/rank_2k/90/gz1684her2k/cutlass_tensorop_gz1684her2k_64x64x8_1x1x1_3_n_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_d1684syr2k_objs.dir/generated/rank_2k/90/d1684syr2k/cutlass_tensorop_d1684syr2k_128x128x16_1x1x1_3_t_l_align1.cu.o [ 82%] Built target cutlass_library_rank_2k_sm80_s1688tf32syr2k_objs [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684her2k_objs.dir/generated/rank_2k/90/gz1684her2k/cutlass_tensorop_gz1684her2k_64x64x8_1x1x1_3_h_l_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_d1684syr2k_objs.dir/generated/rank_2k/90/d1684syr2k/cutlass_tensorop_d1684syr2k_128x128x16_1x1x1_3_t_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684her2k_objs.dir/generated/rank_2k/90/gz1684her2k/cutlass_tensorop_gz1684her2k_64x64x8_1x1x1_3_h_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684syr2k_objs.dir/generated/rank_2k/90/gz1684syr2k/all_sm90_gz1684syr2k_rank_2k_operations.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684her2k_objs.dir/generated/rank_2k/90/z1684her2k/all_sm90_z1684her2k_rank_2k_operations.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684syr2k_objs.dir/generated/rank_2k/90/gz1684syr2k/cutlass_tensorop_gz1684syr2k_64x64x8_1x1x1_3_n_l_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684her2k_objs.dir/generated/rank_2k/90/z1684her2k/cutlass_tensorop_z1684her2k_128x64x8_1x1x1_3_n_l_align1.cu.o [ 82%] Built target cutlass_library_rank_2k_sm90_gz1684her2k_objs [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684syr2k_objs.dir/generated/rank_2k/90/gz1684syr2k/cutlass_tensorop_gz1684syr2k_64x64x8_1x1x1_3_n_u_align1.cu.o [ 82%] Built target cutlass_library_rank_2k_sm90_d1684syr2k_objs [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684her2k_objs.dir/generated/rank_2k/90/z1684her2k/cutlass_tensorop_z1684her2k_128x64x8_1x1x1_3_n_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684her2k_objs.dir/generated/rank_2k/90/z1684her2k/cutlass_tensorop_z1684her2k_128x64x8_1x1x1_3_h_l_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684syr2k_objs.dir/generated/rank_2k/90/gz1684syr2k/cutlass_tensorop_gz1684syr2k_64x64x8_1x1x1_3_t_l_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684syr2k_objs.dir/generated/rank_2k/90/z1684syr2k/all_sm90_z1684syr2k_rank_2k_operations.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684syr2k_objs.dir/generated/rank_2k/90/z1684syr2k/cutlass_tensorop_z1684syr2k_128x64x8_1x1x1_3_n_l_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_gz1684syr2k_objs.dir/generated/rank_2k/90/gz1684syr2k/cutlass_tensorop_gz1684syr2k_64x64x8_1x1x1_3_t_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/all_sm80_c1688tf32trmm_trmm_operations.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684her2k_objs.dir/generated/rank_2k/90/z1684her2k/cutlass_tensorop_z1684her2k_128x64x8_1x1x1_3_h_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_nn_ls_l_nu_align1.cu.o [ 82%] Built target cutlass_library_rank_2k_sm90_gz1684syr2k_objs [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684syr2k_objs.dir/generated/rank_2k/90/z1684syr2k/cutlass_tensorop_z1684syr2k_128x64x8_1x1x1_3_n_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_cn_ls_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/all_sm80_c1688trmm_trmm_operations.cu.o [ 82%] Built target cutlass_library_rank_2k_sm90_z1684her2k_objs [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684syr2k_objs.dir/generated/rank_2k/90/z1684syr2k/cutlass_tensorop_z1684syr2k_128x64x8_1x1x1_3_t_l_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_nn_ls_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_nn_ls_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_cn_ls_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/all_sm80_d884trmm_trmm_operations.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_rank_2k_sm90_z1684syr2k_objs.dir/generated/rank_2k/90/z1684syr2k/cutlass_tensorop_z1684syr2k_128x64x8_1x1x1_3_t_u_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_nn_ls_u_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_cn_ls_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_nn_ls_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_cn_ls_u_nu_align1.cu.o [ 82%] Built target cutlass_library_rank_2k_sm90_z1684syr2k_objs [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_nn_ls_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_nn_ls_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_nn_ls_u_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/all_sm80_gz884trmm_trmm_operations.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_cn_ls_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_nn_ls_u_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_cn_ls_u_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_nn_ls_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_nn_ls_u_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_nn_ls_u_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_nn_rs_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_cn_ls_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_nn_ls_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_nn_rs_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_cn_rs_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_cn_ls_u_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_cn_ls_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_nn_rs_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_nn_rs_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_nn_ls_u_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_nn_ls_u_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_nn_rs_u_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_cn_rs_l_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_cn_ls_u_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_cn_ls_u_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_nn_rs_u_un_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_nn_rs_u_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_nn_rs_l_nu_align1.cu.o [ 82%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_nn_ls_u_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_tn_ls_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_cn_rs_u_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_cn_ls_u_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_cn_rs_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_tn_ls_l_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_nn_rs_u_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_nn_rs_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_nn_rs_l_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_tn_ls_u_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_cn_rs_u_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_cn_rs_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_cn_rs_l_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_tn_ls_u_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_nn_rs_l_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_tn_ls_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_nn_rs_u_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_cn_rs_l_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_tn_rs_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_hn_ls_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_nn_rs_u_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_tn_rs_l_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_cn_rs_u_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_tn_ls_l_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_cn_rs_u_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_tn_rs_u_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_hn_ls_l_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_nn_rs_u_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_nn_rs_u_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_d884trmm_objs.dir/generated/trmm/80/d884trmm/cutlass_tensorop_d884trmm_128x128_16x3_tn_rs_u_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_tn_ls_u_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_cn_rs_u_un_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_cn_rs_u_un_align1.cu.o [ 83%] Built target cutlass_library_trmm_sm80_d884trmm_objs [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_hn_ls_u_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_tn_ls_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_tn_ls_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_hn_ls_l_nu_align1.cu.o [ 83%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/all_sm80_s1688tf32trmm_trmm_operations.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_tn_ls_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_tn_ls_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_nn_ls_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_hn_ls_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_hn_ls_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_hn_ls_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_nn_ls_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_tn_ls_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_tn_ls_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_tn_rs_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_hn_ls_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_nn_ls_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_hn_ls_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_hn_rs_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_tn_ls_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_tn_ls_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_nn_ls_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_tn_rs_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_hn_ls_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_hn_ls_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_nn_rs_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_hn_rs_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_tn_rs_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_nn_rs_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_tn_ls_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_tn_rs_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_hn_rs_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_hn_rs_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_hn_ls_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_nn_rs_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_tn_rs_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_tn_rs_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_hn_rs_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_tn_rs_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_nn_rs_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_tn_rs_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688tf32trmm_objs.dir/generated/trmm/80/c1688tf32trmm/cutlass_tensorop_c1688tf32trmm_128x64_16x4_hn_rs_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_hn_rs_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_tn_ls_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_hn_rs_u_nu_align1.cu.o [ 84%] Built target cutlass_library_trmm_sm80_c1688tf32trmm_objs [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_tn_ls_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_tn_rs_l_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_tn_rs_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/all_sm80_s1688trmm_trmm_operations.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_tn_ls_u_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_nn_ls_l_nu_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_gz884trmm_objs.dir/generated/trmm/80/gz884trmm/cutlass_tensorop_gz884trmm_64x64_8x3_hn_rs_u_un_align1.cu.o [ 84%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_hn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_tn_ls_u_un_align1.cu.o [ 85%] Built target cutlass_library_trmm_sm80_gz884trmm_objs [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_nn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_tn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_tn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/all_sm80_z884trmm_trmm_operations.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_hn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_nn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_tn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_nn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_nn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_tn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_cn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_tn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_nn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_c1688trmm_objs.dir/generated/trmm/80/c1688trmm/cutlass_tensorop_c1688trmm_128x64_16x4_hn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_nn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688tf32trmm_objs.dir/generated/trmm/80/s1688tf32trmm/cutlass_tensorop_s1688tf32trmm_256x128_16x3_tn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_cn_ls_l_un_align1.cu.o [ 85%] Built target cutlass_library_trmm_sm80_c1688trmm_objs [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_nn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_nn_ls_u_nu_align1.cu.o [ 85%] Built target cutlass_library_trmm_sm80_s1688tf32trmm_objs [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_cn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_nn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_nn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/all_sm90_d1684trmm_trmm_operations.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_nn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_nn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/all_sm90_gz1684trmm_trmm_operations.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_tn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_cn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_nn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_nn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_cn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_nn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_tn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_nn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_nn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_cn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_tn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_nn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_cn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_nn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_nn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_tn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_nn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_cn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_nn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_cn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_tn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_nn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_nn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_nn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_cn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_tn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_cn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_nn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_nn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_tn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_nn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_tn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_cn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_cn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_s1688trmm_objs.dir/generated/trmm/80/s1688trmm/cutlass_tensorop_s1688trmm_256x128_16x3_tn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_tn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_tn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_nn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_tn_ls_u_nu_align1.cu.o [ 85%] Built target cutlass_library_trmm_sm80_s1688trmm_objs [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_tn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_hn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_cn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_nn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_tn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_tn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_hn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_cn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/all_sm90_z1684trmm_trmm_operations.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_tn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_tn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_nn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_nn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_tn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_hn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_cn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_cn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_d1684trmm_objs.dir/generated/trmm/90/d1684trmm/cutlass_tensorop_d1684trmm_128x128x16_1x1x1_3_tn_rs_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_tn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_nn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_tn_ls_l_nu_align1.cu.o [ 85%] Built target cutlass_library_trmm_sm90_d1684trmm_objs [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_hn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_hn_ls_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_cn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_nn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_tn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_tn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_hn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_cn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_hn_ls_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688hemm_objs.dir/generated/symm/80/c1688hemm/all_sm80_c1688hemm_symm_operations.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_tn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_nn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688hemm_objs.dir/generated/symm/80/c1688hemm/cutlass_tensorop_c1688hemm_128x64_16x4_n_ls_l_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_tn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_hn_rs_l_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_cn_ls_u_un_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_hn_ls_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688hemm_objs.dir/generated/symm/80/c1688hemm/cutlass_tensorop_c1688hemm_128x64_16x4_n_ls_u_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_tn_rs_u_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_nn_rs_l_nu_align1.cu.o [ 85%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_tn_ls_u_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_hn_rs_u_nu_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_cn_rs_l_nu_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_hn_ls_u_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688hemm_objs.dir/generated/symm/80/c1688hemm/cutlass_tensorop_c1688hemm_128x64_16x4_n_rs_l_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_tn_rs_u_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_nn_rs_l_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_tn_rs_l_nu_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm80_z884trmm_objs.dir/generated/trmm/80/z884trmm/cutlass_tensorop_z884trmm_128x64_8x3_hn_rs_u_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_cn_rs_l_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_hn_rs_l_nu_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688hemm_objs.dir/generated/symm/80/c1688hemm/cutlass_tensorop_c1688hemm_128x64_16x4_n_rs_u_align1.cu.o [ 86%] Built target cutlass_library_trmm_sm80_z884trmm_objs [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_nn_rs_u_nu_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_tn_rs_l_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688symm_objs.dir/generated/symm/80/c1688symm/all_sm80_c1688symm_symm_operations.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688symm_objs.dir/generated/symm/80/c1688symm/cutlass_tensorop_c1688symm_128x64_16x4_n_ls_l_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_hn_rs_l_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_cn_rs_u_nu_align1.cu.o [ 86%] Built target cutlass_library_symm_sm80_c1688hemm_objs [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688symm_objs.dir/generated/symm/80/c1688symm/cutlass_tensorop_c1688symm_128x64_16x4_n_ls_u_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_tn_rs_u_nu_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_nn_rs_u_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32hemm_objs.dir/generated/symm/80/c1688tf32hemm/all_sm80_c1688tf32hemm_symm_operations.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_hn_rs_u_nu_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688symm_objs.dir/generated/symm/80/c1688symm/cutlass_tensorop_c1688symm_128x64_16x4_n_rs_l_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32hemm_objs.dir/generated/symm/80/c1688tf32hemm/cutlass_tensorop_c1688tf32hemm_128x64_16x4_n_ls_l_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_cn_rs_u_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_tn_rs_u_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_tn_ls_l_nu_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32hemm_objs.dir/generated/symm/80/c1688tf32hemm/cutlass_tensorop_c1688tf32hemm_128x64_16x4_n_ls_u_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688symm_objs.dir/generated/symm/80/c1688symm/cutlass_tensorop_c1688symm_128x64_16x4_n_rs_u_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_gz1684trmm_objs.dir/generated/trmm/90/gz1684trmm/cutlass_tensorop_gz1684trmm_64x64x8_1x1x1_3_hn_rs_u_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_hn_ls_l_nu_align1.cu.o [ 86%] Built target cutlass_library_trmm_sm90_gz1684trmm_objs [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32hemm_objs.dir/generated/symm/80/c1688tf32hemm/cutlass_tensorop_c1688tf32hemm_128x64_16x4_n_rs_l_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_tn_ls_l_un_align1.cu.o [ 86%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32symm_objs.dir/generated/symm/80/c1688tf32symm/all_sm80_c1688tf32symm_symm_operations.cu.o [ 86%] Built target cutlass_library_symm_sm80_c1688symm_objs [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32symm_objs.dir/generated/symm/80/c1688tf32symm/cutlass_tensorop_c1688tf32symm_128x64_16x4_n_ls_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_hn_ls_l_un_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32hemm_objs.dir/generated/symm/80/c1688tf32hemm/cutlass_tensorop_c1688tf32hemm_128x64_16x4_n_rs_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_d884symm_objs.dir/generated/symm/80/d884symm/all_sm80_d884symm_symm_operations.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_tn_ls_u_nu_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_d884symm_objs.dir/generated/symm/80/d884symm/cutlass_tensorop_d884symm_128x128_16x3_n_ls_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32symm_objs.dir/generated/symm/80/c1688tf32symm/cutlass_tensorop_c1688tf32symm_128x64_16x4_n_ls_u_align1.cu.o [ 87%] Built target cutlass_library_symm_sm80_c1688tf32hemm_objs [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_hn_ls_u_nu_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_d884symm_objs.dir/generated/symm/80/d884symm/cutlass_tensorop_d884symm_128x128_16x3_n_ls_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32symm_objs.dir/generated/symm/80/c1688tf32symm/cutlass_tensorop_c1688tf32symm_128x64_16x4_n_rs_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884hemm_objs.dir/generated/symm/80/gz884hemm/all_sm80_gz884hemm_symm_operations.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_tn_ls_u_un_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884hemm_objs.dir/generated/symm/80/gz884hemm/cutlass_tensorop_gz884hemm_64x64_8x3_n_ls_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_d884symm_objs.dir/generated/symm/80/d884symm/cutlass_tensorop_d884symm_128x128_16x3_n_rs_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_hn_ls_u_un_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_c1688tf32symm_objs.dir/generated/symm/80/c1688tf32symm/cutlass_tensorop_c1688tf32symm_128x64_16x4_n_rs_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884hemm_objs.dir/generated/symm/80/gz884hemm/cutlass_tensorop_gz884hemm_64x64_8x3_n_ls_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_d884symm_objs.dir/generated/symm/80/d884symm/cutlass_tensorop_d884symm_128x128_16x3_n_rs_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_tn_rs_l_nu_align1.cu.o [ 87%] Built target cutlass_library_symm_sm80_c1688tf32symm_objs [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884hemm_objs.dir/generated/symm/80/gz884hemm/cutlass_tensorop_gz884hemm_64x64_8x3_n_rs_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_hn_rs_l_nu_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884symm_objs.dir/generated/symm/80/gz884symm/all_sm80_gz884symm_symm_operations.cu.o [ 87%] Built target cutlass_library_symm_sm80_d884symm_objs [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884hemm_objs.dir/generated/symm/80/gz884hemm/cutlass_tensorop_gz884hemm_64x64_8x3_n_rs_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884symm_objs.dir/generated/symm/80/gz884symm/cutlass_tensorop_gz884symm_64x64_8x3_n_ls_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_tn_rs_l_un_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_hn_rs_l_un_align1.cu.o [ 87%] Built target cutlass_library_symm_sm80_gz884hemm_objs [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_tn_rs_u_nu_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884symm_objs.dir/generated/symm/80/gz884symm/cutlass_tensorop_gz884symm_64x64_8x3_n_ls_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884symm_objs.dir/generated/symm/80/gz884symm/cutlass_tensorop_gz884symm_64x64_8x3_n_rs_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688symm_objs.dir/generated/symm/80/s1688symm/all_sm80_s1688symm_symm_operations.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688symm_objs.dir/generated/symm/80/s1688symm/cutlass_tensorop_s1688symm_256x128_16x3_n_ls_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_hn_rs_u_nu_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688symm_objs.dir/generated/symm/80/s1688symm/cutlass_tensorop_s1688symm_256x128_16x3_n_ls_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_gz884symm_objs.dir/generated/symm/80/gz884symm/cutlass_tensorop_gz884symm_64x64_8x3_n_rs_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_tn_rs_u_un_align1.cu.o [ 87%] Built target cutlass_library_symm_sm80_gz884symm_objs [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688symm_objs.dir/generated/symm/80/s1688symm/cutlass_tensorop_s1688symm_256x128_16x3_n_rs_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_trmm_sm90_z1684trmm_objs.dir/generated/trmm/90/z1684trmm/cutlass_tensorop_z1684trmm_128x64x8_1x1x1_3_hn_rs_u_un_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688tf32symm_objs.dir/generated/symm/80/s1688tf32symm/all_sm80_s1688tf32symm_symm_operations.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884hemm_objs.dir/generated/symm/80/z884hemm/all_sm80_z884hemm_symm_operations.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688tf32symm_objs.dir/generated/symm/80/s1688tf32symm/cutlass_tensorop_s1688tf32symm_256x128_16x3_n_ls_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884hemm_objs.dir/generated/symm/80/z884hemm/cutlass_tensorop_z884hemm_128x64_8x3_n_ls_l_align1.cu.o [ 87%] Built target cutlass_library_trmm_sm90_z1684trmm_objs [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688tf32symm_objs.dir/generated/symm/80/s1688tf32symm/cutlass_tensorop_s1688tf32symm_256x128_16x3_n_ls_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688symm_objs.dir/generated/symm/80/s1688symm/cutlass_tensorop_s1688symm_256x128_16x3_n_rs_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884hemm_objs.dir/generated/symm/80/z884hemm/cutlass_tensorop_z884hemm_128x64_8x3_n_ls_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884symm_objs.dir/generated/symm/80/z884symm/all_sm80_z884symm_symm_operations.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688tf32symm_objs.dir/generated/symm/80/s1688tf32symm/cutlass_tensorop_s1688tf32symm_256x128_16x3_n_rs_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884symm_objs.dir/generated/symm/80/z884symm/cutlass_tensorop_z884symm_128x64_8x3_n_ls_l_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884hemm_objs.dir/generated/symm/80/z884hemm/cutlass_tensorop_z884hemm_128x64_8x3_n_rs_l_align1.cu.o [ 87%] Built target cutlass_library_symm_sm80_s1688symm_objs [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884symm_objs.dir/generated/symm/80/z884symm/cutlass_tensorop_z884symm_128x64_8x3_n_ls_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_s1688tf32symm_objs.dir/generated/symm/80/s1688tf32symm/cutlass_tensorop_s1688tf32symm_256x128_16x3_n_rs_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884hemm_objs.dir/generated/symm/80/z884hemm/cutlass_tensorop_z884hemm_128x64_8x3_n_rs_u_align1.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_d1684symm_objs.dir/generated/symm/90/d1684symm/all_sm90_d1684symm_symm_operations.cu.o [ 87%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884symm_objs.dir/generated/symm/80/z884symm/cutlass_tensorop_z884symm_128x64_8x3_n_rs_l_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_d1684symm_objs.dir/generated/symm/90/d1684symm/cutlass_tensorop_d1684symm_128x128x16_1x1x1_3_n_ls_l_align1.cu.o [ 88%] Built target cutlass_library_symm_sm80_z884hemm_objs [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_d1684symm_objs.dir/generated/symm/90/d1684symm/cutlass_tensorop_d1684symm_128x128x16_1x1x1_3_n_ls_u_align1.cu.o [ 88%] Built target cutlass_library_symm_sm80_s1688tf32symm_objs [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_d1684symm_objs.dir/generated/symm/90/d1684symm/cutlass_tensorop_d1684symm_128x128x16_1x1x1_3_n_rs_l_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm80_z884symm_objs.dir/generated/symm/80/z884symm/cutlass_tensorop_z884symm_128x64_8x3_n_rs_u_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684hemm_objs.dir/generated/symm/90/gz1684hemm/all_sm90_gz1684hemm_symm_operations.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684hemm_objs.dir/generated/symm/90/gz1684hemm/cutlass_tensorop_gz1684hemm_64x64x8_1x1x1_3_n_ls_l_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684symm_objs.dir/generated/symm/90/gz1684symm/all_sm90_gz1684symm_symm_operations.cu.o [ 88%] Built target cutlass_library_symm_sm80_z884symm_objs [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684hemm_objs.dir/generated/symm/90/gz1684hemm/cutlass_tensorop_gz1684hemm_64x64x8_1x1x1_3_n_ls_u_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_d1684symm_objs.dir/generated/symm/90/d1684symm/cutlass_tensorop_d1684symm_128x128x16_1x1x1_3_n_rs_u_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684symm_objs.dir/generated/symm/90/gz1684symm/cutlass_tensorop_gz1684symm_64x64x8_1x1x1_3_n_ls_l_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684hemm_objs.dir/generated/symm/90/z1684hemm/all_sm90_z1684hemm_symm_operations.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684hemm_objs.dir/generated/symm/90/gz1684hemm/cutlass_tensorop_gz1684hemm_64x64x8_1x1x1_3_n_rs_l_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684symm_objs.dir/generated/symm/90/gz1684symm/cutlass_tensorop_gz1684symm_64x64x8_1x1x1_3_n_ls_u_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684hemm_objs.dir/generated/symm/90/z1684hemm/cutlass_tensorop_z1684hemm_128x64x8_1x1x1_3_n_ls_l_align1.cu.o [ 88%] Built target cutlass_library_symm_sm90_d1684symm_objs [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684hemm_objs.dir/generated/symm/90/gz1684hemm/cutlass_tensorop_gz1684hemm_64x64x8_1x1x1_3_n_rs_u_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684symm_objs.dir/generated/symm/90/gz1684symm/cutlass_tensorop_gz1684symm_64x64x8_1x1x1_3_n_rs_l_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684hemm_objs.dir/generated/symm/90/z1684hemm/cutlass_tensorop_z1684hemm_128x64x8_1x1x1_3_n_ls_u_align1.cu.o [ 88%] Built target cutlass_library_symm_sm90_gz1684hemm_objs [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684hemm_objs.dir/generated/symm/90/z1684hemm/cutlass_tensorop_z1684hemm_128x64x8_1x1x1_3_n_rs_l_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_gz1684symm_objs.dir/generated/symm/90/gz1684symm/cutlass_tensorop_gz1684symm_64x64x8_1x1x1_3_n_rs_u_align1.cu.o [ 88%] Building CUDA object tools/library/CMakeFiles/cutlass_library_symm_sm90_z1684hemm_objs.dir/generated/symm/90/z1684hemm/cutlass_tensorop_z1684hemm_128x64x8_1x1x1_3_n_rs_u_align1.cu.o [ 88%] Linking CUDA static library libcutlass_symm_sm90_z1684symm.a [ 88%] Built target cutlass_library_symm_sm90_z1684symm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm50_cgemm.a [ 88%] Built target cutlass_library_gemm_sm50_cgemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm50_dgemm.a [ 88%] Built target cutlass_library_gemm_sm50_dgemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm50_sgemm.a [ 88%] Built target cutlass_library_gemm_sm50_sgemm_static [ 88%] Built target cutlass_library_symm_sm90_gz1684symm_objs [ 88%] Linking CUDA static library libcutlass_gemm_sm60_hgemm.a [ 88%] Linking CUDA static library libcutlass_gemm_sm61_igemm_s8.a [ 88%] Built target cutlass_library_gemm_sm61_igemm_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm61_s8_igemm_s8.a [ 88%] Built target cutlass_library_gemm_sm60_hgemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm70_f16_s884gemm_f16.a [ 88%] Built target cutlass_library_gemm_sm61_s8_igemm_s8_static [ 88%] Built target cutlass_library_gemm_sm70_f16_s884gemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm70_f16_s884gemm_planar_complex_array_f16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm70_f16_s884gemm_planar_complex_f16.a [ 88%] Built target cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm70_h884gemm.a [ 88%] Built target cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm70_h884gemm_planar_complex.a [ 88%] Built target cutlass_library_gemm_sm70_h884gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm70_h884gemm_planar_complex_array.a [ 88%] Built target cutlass_library_gemm_sm70_h884gemm_planar_complex_static [ 88%] Linking CUDA static library libcutlass_gemm_sm70_s884gemm_f16.a [ 88%] Built target cutlass_library_gemm_sm70_h884gemm_planar_complex_array_static [ 88%] Linking CUDA static library libcutlass_gemm_sm70_s884gemm_planar_complex_array_f16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm70_s884gemm_planar_complex_f16.a [ 88%] Built target cutlass_library_gemm_sm70_s884gemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_f16_s1688gemm_f16.a [ 88%] Built target cutlass_library_gemm_sm75_f16_s1688gemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_array_f16.a [ 88%] Built target cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_f16.a [ 88%] Built target cutlass_library_gemm_sm70_s884gemm_planar_complex_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_h1688gemm.a [ 88%] Built target cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16_static [ 88%] Built target cutlass_library_gemm_sm75_h1688gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_h1688gemm_planar_complex.a [ 88%] Linking CUDA static library libcutlass_gemm_sm75_h1688gemm_planar_complex_array.a [ 88%] Built target cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_i88128xorgemm_b1.a [ 88%] Built target cutlass_library_gemm_sm75_i88128xorgemm_b1_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_i8816gemm_s8.a [ 88%] Built target cutlass_library_gemm_sm75_i8816gemm_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_i8816gemm_u8.a [ 88%] Built target cutlass_library_gemm_sm75_h1688gemm_planar_complex_array_static [ 88%] Built target cutlass_library_gemm_sm75_h1688gemm_planar_complex_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_i8832gemm_s4.a [ 88%] Linking CUDA static library libcutlass_gemm_sm75_i8832gemm_u4.a [ 88%] Built target cutlass_library_gemm_sm75_i8816gemm_u8_static [ 88%] Built target cutlass_library_gemm_sm75_i8832gemm_s4_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_s1688gemm_f16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm75_s1688gemm_planar_complex_array_f16.a [ 88%] Built target cutlass_library_gemm_sm75_i8832gemm_u4_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_s1688gemm_planar_complex_f16.a [ 88%] Built target cutlass_library_gemm_sm75_s1688gemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_s4_i8832gemm_s4.a [ 88%] Built target cutlass_library_gemm_sm75_s4_i8832gemm_s4_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_s8_i8816gemm_s8.a [ 88%] Built target cutlass_library_gemm_sm75_s8_i8816gemm_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_u4_i8832gemm_u4.a [ 88%] Built target cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm75_u8_i8816gemm_u8.a [ 88%] Built target cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_bf16_s16816gemm_bf16.a [ 88%] Built target cutlass_library_gemm_sm75_u4_i8832gemm_u4_static [ 88%] Built target cutlass_library_gemm_sm75_u8_i8816gemm_u8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_bf16_s16816gemm_bf16_s8.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_bf16_s16816gemm_bf16_u8.a [ 88%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_s8_static [ 88%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_u8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_bf16_s16816gemm_s8_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_s8_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_bf16_s16816gemm_u8_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_u8_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_bf16_s16832spgemm_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_bf16_s16832spgemm_bf16_static [ 88%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_c1688gemm.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_c1688tf32gemm.a [ 88%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_cgemm.a [ 88%] Built target cutlass_library_gemm_sm80_c1688gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_d884gemm.a [ 88%] Built target cutlass_library_gemm_sm80_c1688tf32gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_dgemm.a [ 88%] Built target cutlass_library_gemm_sm80_d884gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_f16_s16816gemm_f16.a [ 88%] Built target cutlass_library_gemm_sm80_dgemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_f16_s16816gemm_f16_s8.a [ 88%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_f16_s16816gemm_f16_u8.a [ 88%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_f16_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_array_f16.a [ 88%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_f16_u8_static [ 88%] Built target cutlass_library_gemm_sm80_cgemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_f16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_f16_s16816gemm_s8_f16.a [ 88%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_s8_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_f16_s16816gemm_u8_f16.a [ 88%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_u8_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_f16_s16832spgemm_f16.a [ 88%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_gz884gemm.a [ 88%] Built target cutlass_library_gemm_sm80_f16_s16832spgemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_h16816gemm.a [ 88%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_h16816gemm_f16_s8.a [ 88%] Built target cutlass_library_gemm_sm80_h16816gemm_f16_s8_static [ 88%] Built target cutlass_library_gemm_sm80_h16816gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_h16816gemm_f16_u8.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_h16816gemm_grouped.a [ 88%] Built target cutlass_library_gemm_sm80_h16816gemm_f16_u8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_h16816gemm_planar_complex.a [ 88%] Built target cutlass_library_gemm_sm80_gz884gemm_static [ 88%] Built target cutlass_library_gemm_sm80_h16816gemm_grouped_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_h16816gemm_s8_f16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_h16816gemm_planar_complex_array.a [ 88%] Built target cutlass_library_gemm_sm80_h16816gemm_s8_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_h16816gemm_u8_f16.a [ 88%] Built target cutlass_library_gemm_sm80_h16816gemm_u8_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_h16832spgemm.a [ 88%] Built target cutlass_library_gemm_sm80_h16832spgemm_static [ 88%] Built target cutlass_library_gemm_sm80_h16816gemm_planar_complex_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i168128spgemm_s4.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i168256andgemm_b1.a [ 88%] Built target cutlass_library_gemm_sm80_h16816gemm_planar_complex_array_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i168256xorgemm_b1.a [ 88%] Built target cutlass_library_gemm_sm80_i168256andgemm_b1_static [ 88%] Built target cutlass_library_gemm_sm80_i168128spgemm_s4_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i16832gemm_s4_s8.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i16832gemm_s8.a [ 88%] Built target cutlass_library_gemm_sm80_i168256xorgemm_b1_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i16832gemm_s8_s4.a [ 88%] Built target cutlass_library_gemm_sm80_i16832gemm_s8_static [ 88%] Built target cutlass_library_gemm_sm80_i16832gemm_s4_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i16832gemm_u8.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i16864gemm_s4.a [ 88%] Built target cutlass_library_gemm_sm80_i16832gemm_s8_s4_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i16864gemm_u4.a [ 88%] Built target cutlass_library_gemm_sm80_i16832gemm_u8_static [ 88%] Built target cutlass_library_gemm_sm80_i16864gemm_s4_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_i16864spgemm_s8.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_i16864gemm_u4_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_bf16_s8.a [ 88%] Built target cutlass_library_gemm_sm80_i16864spgemm_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_bf16_u8.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_bf16_s8_static [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_f16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_f16_s8.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_bf16_u8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_f16_u8.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_f16_s8_static [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_f16_u8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_grouped_bf16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_grouped_f16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_planar_complex_array_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_grouped_bf16_static [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_grouped_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_planar_complex_array_f16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_planar_complex_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_planar_complex_f16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_s8_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_s8_f16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_s8_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_u8_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_s8_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816gemm_u8_f16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_u8_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16816tf32spgemm.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_u8_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16832spgemm_bf16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816tf32spgemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s16832spgemm_f16.a [ 88%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s1688bf16gemm.a [ 88%] Built target cutlass_library_gemm_sm80_s16832spgemm_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s1688f16gemm.a [ 88%] Built target cutlass_library_gemm_sm80_s16832spgemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s1688gemm.a [ 88%] Built target cutlass_library_gemm_sm80_s1688bf16gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s1688gemm_tf32.a [ 88%] Built target cutlass_library_gemm_sm80_s1688f16gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s1688tf32gemm.a [ 88%] Built target cutlass_library_gemm_sm80_s1688gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s4_i168128spgemm_s4.a [ 88%] Built target cutlass_library_gemm_sm80_s1688gemm_tf32_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s4_i16864gemm_s4.a [ 88%] Built target cutlass_library_gemm_sm80_s4_i168128spgemm_s4_static [ 88%] Built target cutlass_library_gemm_sm80_s1688tf32gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s8_i16832gemm_s4_s8.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s8_i16832gemm_s8.a [ 88%] Built target cutlass_library_gemm_sm80_s4_i16864gemm_s4_static [ 88%] Built target cutlass_library_gemm_sm80_s8_i16832gemm_s4_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s8_i16864spgemm_s8.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_s8_i16832gemm_s8_s4.a [ 88%] Built target cutlass_library_gemm_sm80_s8_i16832gemm_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_sgemm.a [ 88%] Built target cutlass_library_gemm_sm80_s8_i16864spgemm_s8_static [ 88%] Built target cutlass_library_gemm_sm80_s8_i16832gemm_s8_s4_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_tf32_s1688gemm_tf32.a [ 88%] Linking CUDA static library libcutlass_gemm_sm80_u4_i16864gemm_u4.a [ 88%] Built target cutlass_library_gemm_sm80_u4_i16864gemm_u4_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_u8_i16832gemm_u8.a [ 88%] Built target cutlass_library_gemm_sm80_sgemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm80_z884gemm.a [ 88%] Built target cutlass_library_gemm_sm80_tf32_s1688gemm_tf32_static [ 88%] Built target cutlass_library_gemm_sm80_u8_i16832gemm_u8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3.a [ 88%] Linking CUDA static library libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2.a [ 88%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3_static [ 88%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2.a [ 88%] Linking CUDA static library libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3.a [ 88%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2_static [ 88%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3_static [ 88%] Linking CUDA static library libcutlass_gemm_sm89_s16864spgemm_e4m3.a [ 88%] Linking CUDA static library libcutlass_gemm_sm89_s16864spgemm_e4m3_e5m2.a [ 88%] Built target cutlass_library_gemm_sm89_s16864spgemm_e4m3_static [ 88%] Built target cutlass_library_gemm_sm89_s16864spgemm_e4m3_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm89_s16864spgemm_e5m2.a [ 88%] Linking CUDA static library libcutlass_gemm_sm89_s16864spgemm_e5m2_e4m3.a [ 88%] Built target cutlass_library_gemm_sm89_s16864spgemm_e5m2_e4m3_static [ 88%] Built target cutlass_library_gemm_sm89_s16864spgemm_e5m2_static [ 88%] Built target cutlass_library_gemm_sm80_z884gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3.a [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x16gemm_bf16.a [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2.a [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_static [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2.a [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3.a [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x32spgemm_bf16.a [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3.a [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2.a [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2.a [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3.a [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_d1684gemm.a [ 88%] Built target cutlass_library_gemm_sm90_d1684gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x16gemm_f16.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3.a [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2.a [ 88%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x32spgemm_f16.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_gz1684gemm.a [ 88%] Built target cutlass_library_gemm_sm90_gz1684gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_h64x128x16gemm.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_h64x128x32spgemm.a [ 88%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_i64x128x32gemm_s8.a [ 88%] Built target cutlass_library_gemm_sm90_i64x128x32gemm_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_i64x128x32gemm_u8.a [ 88%] Built target cutlass_library_gemm_sm90_h64x128x16gemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_i64x128x64spgemm_s8.a [ 88%] Built target cutlass_library_gemm_sm90_i64x128x32gemm_u8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_i64x128x64spgemm_u8.a [ 88%] Built target cutlass_library_gemm_sm90_i64x128x64spgemm_s8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x16gemm_bf16.a [ 88%] Built target cutlass_library_gemm_sm90_i64x128x64spgemm_u8_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x16gemm_f16.a [ 88%] Built target cutlass_library_gemm_sm90_h64x128x32spgemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x16spgemm_tf32.a [ 88%] Built target cutlass_library_gemm_sm90_s64x128x16gemm_bf16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x16tf32spgemm.a [ 88%] Built target cutlass_library_gemm_sm90_s64x128x16gemm_f16_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x32gemm_e4m3.a [ 88%] Built target cutlass_library_gemm_sm90_s64x128x16spgemm_tf32_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x32gemm_e4m3_e5m2.a [ 88%] Built target cutlass_library_gemm_sm90_s64x128x16tf32spgemm_static [ 88%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x32gemm_e5m2.a [ 88%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x32gemm_e5m2_e4m3.a [ 89%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x32spgemm_bf16.a [ 89%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x32spgemm_f16.a [ 89%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x64spgemm_e4m3.a [ 89%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x64spgemm_e4m3_e5m2.a [ 89%] Built target cutlass_library_gemm_sm90_s64x128x32spgemm_bf16_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x64spgemm_e5m2.a [ 89%] Built target cutlass_library_gemm_sm90_s64x128x32spgemm_f16_static [ 89%] Built target cutlass_library_symm_sm90_z1684hemm_objs [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x64spgemm_e5m2_e4m3.a [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x8gemm_tf32.a [ 89%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2_static [ 89%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s64x128x8tf32gemm.a [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s8_i64x128x32gemm_s8.a [ 89%] Built target cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8_static [ 89%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s8_i64x128x32gemm_u8.a [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s8_i64x128x64spgemm_s8.a [ 89%] Built target cutlass_library_gemm_sm90_s64x128x8gemm_tf32_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_s8_i64x128x64spgemm_u8.a [ 89%] Built target cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_h64x128x16gemm.a [ 89%] Built target cutlass_library_gemm_sm90_void_h64x128x16gemm_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_h64x128x32spgemm.a [ 89%] Built target cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_i64x128x32gemm_s8.a [ 89%] Built target cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8_static [ 89%] Built target cutlass_library_gemm_sm90_s64x128x8tf32gemm_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_i64x128x32gemm_u8.a [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_i64x128x64spgemm_s8.a [ 89%] Built target cutlass_library_gemm_sm90_void_i64x128x32gemm_s8_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_i64x128x64spgemm_u8.a [ 89%] Built target cutlass_library_gemm_sm90_void_i64x128x32gemm_u8_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x16gemm_bf16.a [ 89%] Built target cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8_static [ 89%] Built target cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x16gemm_f16.a [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3.a [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2.a [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2.a [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16_static [ 89%] Built target cutlass_library_gemm_sm90_void_h64x128x32spgemm_static [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3.a [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x32spgemm_bf16.a [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x32spgemm_f16.a [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3_static [ 89%] Linking CUDA static library libcutlass_rank_k_sm80_z884herk.a [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x16gemm_f16_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3.a [ 89%] Built target cutlass_library_rank_k_sm80_z884herk_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2.a [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2.a [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2_static [ 89%] Linking CUDA static library libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3.a [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_static [ 89%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3_static [ 90%] Linking CUDA static library libcutlass_gemm_sm90_z1684gemm.a [ 90%] Linking CUDA static library libcutlass_conv2d_sm50_cf32_cdgrad_optimized_cf32.a [ 90%] Built target cutlass_library_conv2d_sm50_cf32_cdgrad_optimized_cf32_static [ 90%] Linking CUDA static library libcutlass_conv2d_sm50_cf32_cfprop_optimized_cf32.a [ 90%] Built target cutlass_library_gemm_sm90_z1684gemm_static [ 90%] Linking CUDA static library libcutlass_conv2d_sm50_cf32_cwgrad_optimized_cf32.a [ 90%] Built target cutlass_library_conv2d_sm50_cf32_cfprop_optimized_cf32_static [ 90%] Linking CUDA static library libcutlass_conv2d_sm50_sdgrad_optimized.a [ 90%] Built target cutlass_library_conv2d_sm50_cf32_cwgrad_optimized_cf32_static [ 90%] Linking CUDA static library libcutlass_conv2d_sm50_sfprop_optimized.a [ 90%] Built target cutlass_library_conv2d_sm50_sdgrad_optimized_static [ 90%] Linking CUDA static library libcutlass_conv2d_sm50_swgrad_optimized.a [ 90%] Built target cutlass_library_conv2d_sm50_sfprop_optimized_static [ 90%] Linking CUDA static library libcutlass_conv2d_sm60_hfprop_optimized.a [ 90%] Built target cutlass_library_conv2d_sm50_swgrad_optimized_static [ 90%] Built target cutlass_library_conv2d_sm60_hfprop_optimized_static [ 90%] Linking CUDA static library libcutlass_conv2d_sm70_f16_s884dgrad_optimized_f16.a [ 90%] Linking CUDA static library libcutlass_conv2d_sm70_f16_s884fprop_optimized_f16.a [ 90%] Built target cutlass_library_conv2d_sm70_f16_s884dgrad_optimized_f16_static [ 90%] Built target cutlass_library_conv2d_sm70_f16_s884fprop_optimized_f16_static [ 90%] Linking CUDA static library libcutlass_conv2d_sm70_f16_s884wgrad_optimized_f16.a [ 90%] Linking CUDA static library libcutlass_conv2d_sm70_h884dgrad_optimized.a [ 90%] Built target cutlass_library_conv2d_sm70_f16_s884wgrad_optimized_f16_static [ 90%] Built target cutlass_library_conv2d_sm70_h884dgrad_optimized_static [ 90%] Linking CUDA static library libcutlass_conv2d_sm70_h884fprop_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm70_h884wgrad_optimized.a [ 91%] Built target cutlass_library_conv2d_sm70_h884fprop_optimized_static [ 91%] Built target cutlass_library_conv2d_sm70_h884wgrad_optimized_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm70_s884dgrad_optimized_f16.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm70_s884fprop_optimized_f16.a [ 91%] Built target cutlass_library_conv2d_sm70_s884fprop_optimized_f16_static [ 91%] Built target cutlass_library_conv2d_sm70_s884dgrad_optimized_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm70_s884wgrad_optimized_f16.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_cf32_cdgrad_optimized_cf32.a [ 91%] Built target cutlass_library_conv2d_sm70_s884wgrad_optimized_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_cf32_cfprop_optimized_cf32.a [ 91%] Built target cutlass_library_conv2d_sm75_cf32_cdgrad_optimized_cf32_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_cf32_cwgrad_optimized_cf32.a [ 91%] Built target cutlass_library_conv2d_sm75_cf32_cfprop_optimized_cf32_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_f16_s1688dgrad_optimized_f16.a [ 91%] Built target cutlass_library_conv2d_sm75_cf32_cwgrad_optimized_cf32_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_f16_s1688fprop_few_channels_f16.a [ 91%] Built target cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16_static [ 91%] Built target cutlass_library_conv2d_sm75_f16_s1688dgrad_optimized_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_f16_s1688fprop_fixed_channels_f16.a [ 91%] Built target cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_f16_s1688fprop_optimized_f16.a [ 91%] Built target cutlass_library_conv2d_sm75_f16_s1688fprop_few_channels_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_f16_s1688wgrad_optimized_f16.a [ 91%] Built target cutlass_library_conv2d_sm75_f16_s1688fprop_fixed_channels_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_h1688dgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_h1688fprop_few_channels.a [ 91%] Built target cutlass_library_conv2d_sm75_f16_s1688fprop_optimized_f16_static [ 91%] Built target cutlass_library_conv2d_sm75_f16_s1688wgrad_optimized_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_h1688fprop_fixed_channels.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_h1688fprop_optimized.a [ 91%] Built target cutlass_library_conv2d_sm75_h1688fprop_few_channels_static [ 91%] Built target cutlass_library_conv2d_sm75_h1688dgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm75_h1688fprop_fixed_channels_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_h1688wgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_i8816fprop_optimized_s8.a [ 91%] Built target cutlass_library_conv2d_sm75_h1688fprop_optimized_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_i8816fprop_optimized_u8.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_i8832fprop_optimized_s4.a [ 91%] Built target cutlass_library_conv2d_sm75_h1688wgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm75_i8816fprop_optimized_s8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_i8832fprop_optimized_u4.a [ 91%] Built target cutlass_library_conv2d_sm75_i8816fprop_optimized_u8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_s1688dgrad_optimized_f16.a [ 91%] Built target cutlass_library_conv2d_sm75_i8832fprop_optimized_s4_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_s1688fprop_few_channels_f16.a [ 91%] Built target cutlass_library_conv2d_sm75_i8832fprop_optimized_u4_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_s1688fprop_fixed_channels_f16.a [ 91%] Built target cutlass_library_conv2d_sm75_s1688dgrad_optimized_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_s1688fprop_optimized_f16.a [ 91%] Built target cutlass_library_conv2d_sm75_s1688fprop_few_channels_f16_static [ 91%] Built target cutlass_library_conv2d_sm75_s1688fprop_fixed_channels_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_s1688wgrad_optimized_f16.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_s4_i8832fprop_optimized_s4.a [ 91%] Built target cutlass_library_conv2d_sm75_s1688fprop_optimized_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_s8_i8816fprop_few_channels_s8.a [ 91%] Built target cutlass_library_conv2d_sm75_s4_i8832fprop_optimized_s4_static [ 91%] Built target cutlass_library_conv2d_sm75_s1688wgrad_optimized_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_s8_i8816fprop_fixed_channels_s8.a [ 91%] Built target cutlass_library_conv2d_sm75_s8_i8816fprop_few_channels_s8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_s8_i8816fprop_optimized_s8.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_u4_i8832fprop_optimized_u4.a [ 91%] Built target cutlass_library_conv2d_sm75_s8_i8816fprop_fixed_channels_s8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_u8_i8816fprop_few_channels_u8.a [ 91%] Built target cutlass_library_conv2d_sm75_s8_i8816fprop_optimized_s8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_u8_i8816fprop_fixed_channels_u8.a [ 91%] Built target cutlass_library_conv2d_sm75_u4_i8832fprop_optimized_u4_static [ 91%] Built target cutlass_library_conv2d_sm75_u8_i8816fprop_few_channels_u8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm75_u8_i8816fprop_optimized_u8.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_bf16_s16816dgrad_optimized_bf16.a [ 91%] Built target cutlass_library_conv2d_sm75_u8_i8816fprop_fixed_channels_u8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16.a [ 91%] Built target cutlass_library_conv2d_sm75_u8_i8816fprop_optimized_u8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_bf16_s16816fprop_optimized_bf16.a [ 91%] Built target cutlass_library_conv2d_sm80_bf16_s16816dgrad_optimized_bf16_static [ 91%] Built target cutlass_library_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_bf16_s16816wgrad_optimized_bf16.a [ 91%] Built target cutlass_library_conv2d_sm80_bf16_s16816fprop_optimized_bf16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_f16_s16816dgrad_optimized_f16.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_f16_s16816fprop_fixed_channels_f16.a [ 91%] Built target cutlass_library_conv2d_sm80_bf16_s16816wgrad_optimized_bf16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_f16_s16816fprop_optimized_f16.a [ 91%] Built target cutlass_library_conv2d_sm80_f16_s16816dgrad_optimized_f16_static [ 91%] Built target cutlass_library_conv2d_sm80_f16_s16816fprop_fixed_channels_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_f16_s16816wgrad_optimized_f16.a [ 91%] Built target cutlass_library_conv2d_sm80_f16_s16816fprop_optimized_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_h16816dgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_h16816fprop_fixed_channels.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_h16816fprop_optimized.a [ 91%] Built target cutlass_library_conv2d_sm80_f16_s16816wgrad_optimized_f16_static [ 91%] Built target cutlass_library_conv2d_sm80_h16816dgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_h16816fprop_fixed_channels_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_h16816wgrad_optimized.a [ 91%] Built target cutlass_library_conv2d_sm80_h16816fprop_optimized_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_i16832fprop_optimized_s8.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_i16832fprop_optimized_u8.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_i16864fprop_optimized_s4.a [ 91%] Built target cutlass_library_conv2d_sm80_h16816wgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_i16832fprop_optimized_s8_static [ 91%] Built target cutlass_library_conv2d_sm80_i16832fprop_optimized_u8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_i16864fprop_optimized_u4.a [ 91%] Built target cutlass_library_conv2d_sm80_i16864fprop_optimized_s4_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s16816dgrad_optimized_bf16.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s16816dgrad_optimized_f16.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s16816fprop_fixed_channels_bf16.a [ 91%] Built target cutlass_library_conv2d_sm80_i16864fprop_optimized_u4_static [ 91%] Built target cutlass_library_conv2d_sm80_s16816dgrad_optimized_bf16_static [ 91%] Built target cutlass_library_conv2d_sm80_s16816dgrad_optimized_f16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s16816fprop_fixed_channels_f16.a [ 91%] Built target cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_bf16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s16816fprop_optimized_bf16.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s16816fprop_optimized_f16.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s16816wgrad_optimized_bf16.a [ 91%] Built target cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_f16_static [ 91%] Built target cutlass_library_conv2d_sm80_s16816fprop_optimized_bf16_static [ 91%] Built target cutlass_library_conv2d_sm80_s16816fprop_optimized_f16_static [ 91%] Built target cutlass_library_conv2d_sm80_s16816wgrad_optimized_bf16_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s16816wgrad_optimized_f16.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688bf16dgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688bf16fprop_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688bf16wgrad_optimized.a [ 91%] Built target cutlass_library_conv2d_sm80_s16816wgrad_optimized_f16_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688bf16dgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688bf16fprop_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688bf16wgrad_optimized_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688dgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688dgrad_optimized_tf32.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688f16dgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688f16fprop_optimized.a [ 91%] Built target cutlass_library_conv2d_sm80_s1688dgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688dgrad_optimized_tf32_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688f16fprop_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688f16dgrad_optimized_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688f16wgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688fprop_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688fprop_optimized_tf32.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688tf32dgrad_optimized.a [ 91%] Built target cutlass_library_conv2d_sm80_s1688f16wgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688fprop_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688tf32dgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688fprop_optimized_tf32_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688tf32fprop_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688tf32wgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688wgrad_optimized_tf32.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s1688wgrad_optimized.a [ 91%] Built target cutlass_library_conv2d_sm80_s1688tf32fprop_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688tf32wgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688wgrad_optimized_tf32_static [ 91%] Built target cutlass_library_conv2d_sm80_s1688wgrad_optimized_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s4_i16864fprop_optimized_s4.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s8_i16832fprop_few_channels_s8.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s8_i16832fprop_fixed_channels_s8.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_s8_i16832fprop_optimized_s8.a [ 91%] Built target cutlass_library_conv2d_sm80_s4_i16864fprop_optimized_s4_static [ 91%] Built target cutlass_library_conv2d_sm80_s8_i16832fprop_few_channels_s8_static [ 91%] Built target cutlass_library_conv2d_sm80_s8_i16832fprop_fixed_channels_s8_static [ 91%] Built target cutlass_library_conv2d_sm80_s8_i16832fprop_optimized_s8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_sdgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_sfprop_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_swgrad_optimized.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_tf32_s1688dgrad_optimized_tf32.a [ 91%] Built target cutlass_library_conv2d_sm80_sdgrad_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_sfprop_optimized_static [ 91%] Built target cutlass_library_conv2d_sm80_tf32_s1688dgrad_optimized_tf32_static [ 91%] Built target cutlass_library_conv2d_sm80_swgrad_optimized_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_tf32_s1688fprop_optimized_tf32.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_tf32_s1688wgrad_optimized_tf32.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_u4_i16864fprop_optimized_u4.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_u8_i16832fprop_few_channels_u8.a [ 91%] Built target cutlass_library_conv2d_sm80_tf32_s1688fprop_optimized_tf32_static [ 91%] Built target cutlass_library_conv2d_sm80_tf32_s1688wgrad_optimized_tf32_static [ 91%] Built target cutlass_library_conv2d_sm80_u4_i16864fprop_optimized_u4_static [ 91%] Built target cutlass_library_conv2d_sm80_u8_i16832fprop_few_channels_u8_static [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_u8_i16832fprop_optimized_u8.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm80_u8_i16832fprop_fixed_channels_u8.a [ 91%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Built target cutlass_library_conv2d_sm80_u8_i16832fprop_optimized_u8_static [ 92%] Built target cutlass_library_conv2d_sm80_u8_i16832fprop_fixed_channels_u8_static [ 92%] Built target cutlass_library_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Built target cutlass_library_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Built target cutlass_library_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Built target cutlass_library_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Built target cutlass_library_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Built target cutlass_library_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Built target cutlass_library_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Built target cutlass_library_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Built target cutlass_library_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Built target cutlass_library_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 92%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Built target cutlass_library_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Built target cutlass_library_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Built target cutlass_library_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Built target cutlass_library_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Built target cutlass_library_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.a [ 93%] Built target cutlass_library_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16_static [ 93%] Built target cutlass_library_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a [ 93%] Built target cutlass_library_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.a [ 93%] Built target cutlass_library_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32.a [ 93%] Built target cutlass_library_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32.a [ 93%] Built target cutlass_library_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32_static [ 93%] Built target cutlass_library_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32_static [ 93%] Built target cutlass_library_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32_static [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32.a [ 93%] Linking CUDA static library libcutlass_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16.a [ 93%] Built target cutlass_library_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32_static [ 93%] Built target cutlass_library_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32_static [ 93%] Built target cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16_static [ 93%] Built target cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16_static [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_f16_s16816dgrad3d_analytic_f16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_f16_s16816dgrad3d_optimized_f16.a [ 93%] Built target cutlass_library_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16_static [ 93%] Built target cutlass_library_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16_static [ 93%] Built target cutlass_library_conv3d_sm80_f16_s16816dgrad3d_optimized_f16_static [ 93%] Built target cutlass_library_conv3d_sm80_f16_s16816dgrad3d_analytic_f16_static [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_f16_s16816fprop3d_optimized_f16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_f16_s16816wgrad3d_optimized_f16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_h16816dgrad3d_optimized.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_h16816dgrad3d_analytic.a [ 93%] Built target cutlass_library_conv3d_sm80_f16_s16816fprop3d_optimized_f16_static [ 93%] Built target cutlass_library_conv3d_sm80_f16_s16816wgrad3d_optimized_f16_static [ 93%] Built target cutlass_library_conv3d_sm80_h16816dgrad3d_optimized_static [ 93%] Built target cutlass_library_conv3d_sm80_h16816dgrad3d_analytic_static [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_h16816fprop3d_optimized.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_h16816wgrad3d_optimized.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_s16816dgrad3d_analytic_f16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_s16816dgrad3d_analytic_bf16.a [ 93%] Built target cutlass_library_conv3d_sm80_h16816fprop3d_optimized_static [ 93%] Built target cutlass_library_conv3d_sm80_h16816wgrad3d_optimized_static [ 93%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_bf16_static [ 93%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_f16_static [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_s16816dgrad3d_optimized_f16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_s16816dgrad3d_optimized_bf16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_s16816fprop3d_optimized_bf16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_s16816fprop3d_optimized_f16.a [ 93%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_bf16_static [ 93%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_f16_static [ 93%] Built target cutlass_library_conv3d_sm80_s16816fprop3d_optimized_f16_static [ 93%] Built target cutlass_library_conv3d_sm80_s16816fprop3d_optimized_bf16_static [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_s16816wgrad3d_optimized_bf16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm80_s16816wgrad3d_optimized_f16.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32.a [ 93%] Built target cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_bf16_static [ 93%] Built target cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_f16_static [ 93%] Built target cutlass_library_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32_static [ 93%] Linking CUDA static library libcutlass_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32.a [ 93%] Linking CUDA static library libcutlass_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_c1688syrk.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_c1688herk.a [ 93%] Built target cutlass_library_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32_static [ 93%] Built target cutlass_library_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32_static [ 93%] Built target cutlass_library_rank_k_sm80_c1688syrk_static [ 93%] Built target cutlass_library_rank_k_sm80_c1688herk_static [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_c1688tf32herk.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_c1688tf32syrk.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_d884syrk.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_gz884herk.a [ 93%] Built target cutlass_library_rank_k_sm80_c1688tf32herk_static [ 93%] Built target cutlass_library_rank_k_sm80_gz884herk_static [ 93%] Built target cutlass_library_rank_k_sm80_d884syrk_static [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_gz884syrk.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_s1688syrk.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_s1688tf32syrk.a [ 93%] Built target cutlass_library_rank_k_sm80_gz884syrk_static [ 93%] Built target cutlass_library_rank_k_sm80_c1688tf32syrk_static [ 93%] Built target cutlass_library_rank_k_sm80_s1688syrk_static [ 93%] Linking CUDA static library libcutlass_rank_k_sm80_z884syrk.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm90_d1684syrk.a [ 93%] Built target cutlass_library_rank_k_sm80_s1688tf32syrk_static [ 93%] Linking CUDA static library libcutlass_rank_k_sm90_gz1684herk.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm90_gz1684syrk.a [ 93%] Built target cutlass_library_rank_k_sm80_z884syrk_static [ 93%] Built target cutlass_library_rank_k_sm90_d1684syrk_static [ 93%] Built target cutlass_library_rank_k_sm90_gz1684herk_static [ 93%] Linking CUDA static library libcutlass_rank_k_sm90_z1684herk.a [ 93%] Linking CUDA static library libcutlass_rank_k_sm90_z1684syrk.a [ 93%] Built target cutlass_library_rank_k_sm90_gz1684syrk_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_c1688her2k.a [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_c1688syr2k.a [ 93%] Built target cutlass_library_rank_k_sm90_z1684herk_static [ 93%] Built target cutlass_library_rank_k_sm90_z1684syrk_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_c1688tf32her2k.a [ 93%] Built target cutlass_library_rank_2k_sm80_c1688her2k_static [ 93%] Built target cutlass_library_rank_2k_sm80_c1688syr2k_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_c1688tf32syr2k.a [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_d884syr2k.a [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_gz884her2k.a [ 93%] Built target cutlass_library_rank_2k_sm80_c1688tf32her2k_static [ 93%] Built target cutlass_library_rank_2k_sm80_c1688tf32syr2k_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_gz884syr2k.a [ 93%] Built target cutlass_library_rank_2k_sm80_gz884her2k_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_s1688syr2k.a [ 93%] Built target cutlass_library_rank_2k_sm80_d884syr2k_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_s1688tf32syr2k.a [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_z884her2k.a [ 93%] Built target cutlass_library_rank_2k_sm80_gz884syr2k_static [ 93%] Built target cutlass_library_rank_2k_sm80_s1688syr2k_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm80_z884syr2k.a [ 93%] Built target cutlass_library_rank_2k_sm80_z884her2k_static [ 93%] Built target cutlass_library_rank_2k_sm80_s1688tf32syr2k_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm90_d1684syr2k.a [ 93%] Linking CUDA static library libcutlass_rank_2k_sm90_gz1684her2k.a [ 93%] Linking CUDA static library libcutlass_rank_2k_sm90_gz1684syr2k.a [ 93%] Built target cutlass_library_rank_2k_sm80_z884syr2k_static [ 93%] Built target cutlass_library_rank_2k_sm90_d1684syr2k_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm90_z1684her2k.a [ 93%] Built target cutlass_library_rank_2k_sm90_gz1684syr2k_static [ 93%] Built target cutlass_library_rank_2k_sm90_gz1684her2k_static [ 93%] Linking CUDA static library libcutlass_rank_2k_sm90_z1684syr2k.a [ 93%] Linking CUDA static library libcutlass_trmm_sm80_c1688trmm.a [ 93%] Linking CUDA static library libcutlass_trmm_sm80_c1688tf32trmm.a [ 93%] Built target cutlass_library_rank_2k_sm90_z1684her2k_static [ 93%] Built target cutlass_library_rank_2k_sm90_z1684syr2k_static [ 93%] Linking CUDA static library libcutlass_trmm_sm80_d884trmm.a [ 93%] Linking CUDA static library libcutlass_trmm_sm80_gz884trmm.a [ 93%] Built target cutlass_library_trmm_sm80_c1688tf32trmm_static [ 93%] Built target cutlass_library_trmm_sm80_c1688trmm_static [ 93%] Built target cutlass_library_trmm_sm80_d884trmm_static [ 93%] Linking CUDA static library libcutlass_trmm_sm80_s1688tf32trmm.a [ 93%] Linking CUDA static library libcutlass_trmm_sm80_s1688trmm.a [ 93%] Built target cutlass_library_trmm_sm80_gz884trmm_static [ 93%] Linking CUDA static library libcutlass_trmm_sm80_z884trmm.a [ 93%] Linking CUDA static library libcutlass_trmm_sm90_d1684trmm.a [ 93%] Built target cutlass_library_trmm_sm80_s1688tf32trmm_static [ 93%] Built target cutlass_library_trmm_sm80_s1688trmm_static [ 94%] Linking CUDA static library libcutlass_trmm_sm90_gz1684trmm.a [ 94%] Built target cutlass_library_trmm_sm80_z884trmm_static [ 94%] Linking CUDA static library libcutlass_trmm_sm90_z1684trmm.a [ 94%] Built target cutlass_library_trmm_sm90_d1684trmm_static [ 94%] Linking CUDA static library libcutlass_symm_sm80_c1688hemm.a [ 94%] Linking CUDA static library libcutlass_symm_sm80_c1688symm.a [ 94%] Built target cutlass_library_trmm_sm90_gz1684trmm_static [ 94%] Built target cutlass_library_symm_sm80_c1688hemm_static [ 94%] Built target cutlass_library_trmm_sm90_z1684trmm_static [ 94%] Built target cutlass_library_symm_sm80_c1688symm_static [ 94%] Linking CUDA static library libcutlass_symm_sm80_c1688tf32hemm.a [ 94%] Linking CUDA static library libcutlass_symm_sm80_c1688tf32symm.a [ 94%] Linking CUDA static library libcutlass_symm_sm80_d884symm.a [ 94%] Linking CUDA static library libcutlass_symm_sm80_gz884hemm.a [ 94%] Built target cutlass_library_symm_sm80_c1688tf32hemm_static [ 94%] Built target cutlass_library_symm_sm80_c1688tf32symm_static [ 94%] Built target cutlass_library_symm_sm80_d884symm_static [ 94%] Built target cutlass_library_symm_sm80_gz884hemm_static [ 94%] Linking CUDA static library libcutlass_symm_sm80_gz884symm.a [ 94%] Linking CUDA static library libcutlass_symm_sm80_s1688symm.a [ 94%] Linking CUDA static library libcutlass_symm_sm80_s1688tf32symm.a [ 94%] Linking CUDA static library libcutlass_symm_sm80_z884hemm.a [ 94%] Built target cutlass_library_symm_sm80_gz884symm_static [ 94%] Built target cutlass_library_symm_sm80_s1688tf32symm_static [ 94%] Built target cutlass_library_symm_sm80_s1688symm_static [ 94%] Built target cutlass_library_symm_sm80_z884hemm_static [ 94%] Linking CUDA static library libcutlass_symm_sm80_z884symm.a [ 94%] Linking CUDA static library libcutlass_symm_sm90_d1684symm.a [ 94%] Linking CUDA static library libcutlass_symm_sm90_gz1684hemm.a [ 94%] Linking CUDA static library libcutlass_symm_sm90_gz1684symm.a [ 94%] Built target cutlass_library_symm_sm80_z884symm_static [ 94%] Built target cutlass_library_symm_sm90_gz1684hemm_static [ 94%] Built target cutlass_library_symm_sm90_d1684symm_static [ 94%] Built target cutlass_library_symm_sm90_gz1684symm_static [ 94%] Linking CUDA static library libcutlass_symm_sm90_z1684hemm.a [ 94%] Linking CUDA shared library libcutlass_gemm_sm50_cgemm.so [ 94%] Linking CUDA shared library libcutlass_symm_sm90_z1684symm.so [ 94%] Linking CUDA shared library libcutlass_gemm_sm50_dgemm.so [ 94%] Built target cutlass_library_symm_sm90_z1684hemm_static [ 94%] Linking CUDA shared library libcutlass_gemm_sm50_sgemm.so [ 94%] Built target cutlass_library_gemm_sm50_dgemm [ 94%] Built target cutlass_library_symm_sm90_z1684symm [ 94%] Built target cutlass_library_gemm_sm50_sgemm [ 94%] Built target cutlass_library_gemm_sm50_cgemm [ 94%] Linking CUDA shared library libcutlass_gemm_sm70_f16_s884gemm_f16.so [ 94%] Linking CUDA shared library libcutlass_gemm_sm61_igemm_s8.so [ 94%] Linking CUDA shared library libcutlass_gemm_sm61_s8_igemm_s8.so [ 94%] Linking CUDA shared library libcutlass_gemm_sm60_hgemm.so [ 94%] Built target cutlass_library_gemm_sm61_s8_igemm_s8 [ 94%] Built target cutlass_library_gemm_sm70_f16_s884gemm_f16 [ 94%] Built target cutlass_library_gemm_sm61_igemm_s8 [ 94%] Built target cutlass_library_gemm_sm60_hgemm [ 94%] Linking CUDA shared library libcutlass_gemm_sm70_f16_s884gemm_planar_complex_array_f16.so [ 94%] Linking CUDA shared library libcutlass_gemm_sm70_h884gemm.so [ 94%] Linking CUDA shared library libcutlass_gemm_sm70_f16_s884gemm_planar_complex_f16.so [ 94%] Linking CUDA shared library libcutlass_gemm_sm70_h884gemm_planar_complex.so [ 94%] Built target cutlass_library_gemm_sm70_h884gemm [ 94%] Built target cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_array_f16 [ 94%] Built target cutlass_library_gemm_sm70_f16_s884gemm_planar_complex_f16 [ 94%] Built target cutlass_library_gemm_sm70_h884gemm_planar_complex [ 94%] Linking CUDA shared library libcutlass_gemm_sm70_h884gemm_planar_complex_array.so [ 94%] Linking CUDA shared library libcutlass_gemm_sm70_s884gemm_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm70_s884gemm_planar_complex_array_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm70_s884gemm_planar_complex_f16.so [ 95%] Built target cutlass_library_gemm_sm70_s884gemm_f16 [ 95%] Built target cutlass_library_gemm_sm70_h884gemm_planar_complex_array [ 95%] Built target cutlass_library_gemm_sm70_s884gemm_planar_complex_array_f16 [ 95%] Built target cutlass_library_gemm_sm70_s884gemm_planar_complex_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_f16_s1688gemm_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_array_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_h1688gemm.so [ 95%] Built target cutlass_library_gemm_sm75_f16_s1688gemm_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_h1688gemm_planar_complex.so [ 95%] Built target cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_array_f16 [ 95%] Built target cutlass_library_gemm_sm75_h1688gemm [ 95%] Built target cutlass_library_gemm_sm75_f16_s1688gemm_planar_complex_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_h1688gemm_planar_complex_array.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_i88128xorgemm_b1.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_i8816gemm_s8.so [ 95%] Built target cutlass_library_gemm_sm75_h1688gemm_planar_complex [ 95%] Built target cutlass_library_gemm_sm75_i88128xorgemm_b1 [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_i8816gemm_u8.so [ 95%] Built target cutlass_library_gemm_sm75_i8816gemm_s8 [ 95%] Built target cutlass_library_gemm_sm75_h1688gemm_planar_complex_array [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_i8832gemm_s4.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_i8832gemm_u4.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_s1688gemm_f16.so [ 95%] Built target cutlass_library_gemm_sm75_i8816gemm_u8 [ 95%] Built target cutlass_library_gemm_sm75_i8832gemm_s4 [ 95%] Built target cutlass_library_gemm_sm75_i8832gemm_u4 [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_s1688gemm_planar_complex_array_f16.so [ 95%] Built target cutlass_library_gemm_sm75_s1688gemm_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_s1688gemm_planar_complex_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_s4_i8832gemm_s4.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_s8_i8816gemm_s8.so [ 95%] Built target cutlass_library_gemm_sm75_s1688gemm_planar_complex_array_f16 [ 95%] Built target cutlass_library_gemm_sm75_s4_i8832gemm_s4 [ 95%] Built target cutlass_library_gemm_sm75_s8_i8816gemm_s8 [ 95%] Built target cutlass_library_gemm_sm75_s1688gemm_planar_complex_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_u4_i8832gemm_u4.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm75_u8_i8816gemm_u8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_bf16_s16816gemm_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_bf16_s16816gemm_bf16_s8.so [ 95%] Built target cutlass_library_gemm_sm75_u4_i8832gemm_u4 [ 95%] Built target cutlass_library_gemm_sm75_u8_i8816gemm_u8 [ 95%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_bf16 [ 95%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_bf16_s16816gemm_bf16_u8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_bf16_s16816gemm_s8_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_bf16.so [ 95%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_bf16_u8 [ 95%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_s8_bf16 [ 95%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_bf16_s16816gemm_u8_bf16.so [ 95%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_planar_complex_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_bf16_s16832spgemm_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_c1688gemm.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_c1688tf32gemm.so [ 95%] Built target cutlass_library_gemm_sm80_bf16_s16816gemm_u8_bf16 [ 95%] Built target cutlass_library_gemm_sm80_bf16_s16832spgemm_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_cgemm.so [ 95%] Built target cutlass_library_gemm_sm80_c1688gemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_d884gemm.so [ 95%] Built target cutlass_library_gemm_sm80_c1688tf32gemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_dgemm.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_f16_s16816gemm_f16.so [ 95%] Built target cutlass_library_gemm_sm80_d884gemm [ 95%] Built target cutlass_library_gemm_sm80_cgemm [ 95%] Built target cutlass_library_gemm_sm80_dgemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_f16_s16816gemm_f16_s8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_f16_s16816gemm_f16_u8.so [ 95%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_array_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_f16.so [ 95%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_f16_u8 [ 95%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_f16_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_f16_s16816gemm_u8_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_f16_s16816gemm_s8_f16.so [ 95%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_array_f16 [ 95%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_planar_complex_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_f16_s16832spgemm_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_gz884gemm.so [ 95%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_s8_f16 [ 95%] Built target cutlass_library_gemm_sm80_f16_s16816gemm_u8_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_h16816gemm.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_h16816gemm_f16_s8.so [ 95%] Built target cutlass_library_gemm_sm80_f16_s16832spgemm_f16 [ 95%] Built target cutlass_library_gemm_sm80_gz884gemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_h16816gemm_f16_u8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_h16816gemm_grouped.so [ 95%] Built target cutlass_library_gemm_sm80_h16816gemm [ 95%] Built target cutlass_library_gemm_sm80_h16816gemm_f16_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_h16816gemm_planar_complex.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_h16816gemm_planar_complex_array.so [ 95%] Built target cutlass_library_gemm_sm80_h16816gemm_f16_u8 [ 95%] Built target cutlass_library_gemm_sm80_h16816gemm_grouped [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_h16816gemm_s8_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_h16816gemm_u8_f16.so [ 95%] Built target cutlass_library_gemm_sm80_h16816gemm_planar_complex_array [ 95%] Built target cutlass_library_gemm_sm80_h16816gemm_planar_complex [ 95%] Built target cutlass_library_gemm_sm80_h16816gemm_s8_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i168128spgemm_s4.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_h16832spgemm.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i168256andgemm_b1.so [ 95%] Built target cutlass_library_gemm_sm80_h16816gemm_u8_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i168256xorgemm_b1.so [ 95%] Built target cutlass_library_gemm_sm80_i168128spgemm_s4 [ 95%] Built target cutlass_library_gemm_sm80_h16832spgemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i16832gemm_s4_s8.so [ 95%] Built target cutlass_library_gemm_sm80_i168256andgemm_b1 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i16832gemm_s8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i16832gemm_s8_s4.so [ 95%] Built target cutlass_library_gemm_sm80_i168256xorgemm_b1 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i16832gemm_u8.so [ 95%] Built target cutlass_library_gemm_sm80_i16832gemm_s4_s8 [ 95%] Built target cutlass_library_gemm_sm80_i16832gemm_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i16864gemm_s4.so [ 95%] Built target cutlass_library_gemm_sm80_i16832gemm_s8_s4 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i16864gemm_u4.so [ 95%] Built target cutlass_library_gemm_sm80_i16832gemm_u8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_i16864spgemm_s8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_bf16.so [ 95%] Built target cutlass_library_gemm_sm80_i16864gemm_s4 [ 95%] Built target cutlass_library_gemm_sm80_i16864gemm_u4 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_bf16_s8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_bf16_u8.so [ 95%] Built target cutlass_library_gemm_sm80_i16864spgemm_s8 [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_f16_s8.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_bf16_s8 [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_bf16_u8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_f16_u8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_grouped_bf16.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_f16 [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_f16_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_grouped_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_planar_complex_array_bf16.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_f16_u8 [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_grouped_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_planar_complex_array_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_i64x128x64spgemm_s8.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_grouped_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_planar_complex_bf16.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_planar_complex_f16.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_array_f16 [ 95%] Built target cutlass_library_gemm_sm90_void_i64x128x64spgemm_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_s8_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_s8_f16.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_u8_bf16.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_planar_complex_f16 [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_s8_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816gemm_u8_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16816tf32spgemm.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_s8_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16832spgemm_bf16.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_u8_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s16832spgemm_f16.so [ 95%] Built target cutlass_library_gemm_sm80_s16816gemm_u8_f16 [ 95%] Built target cutlass_library_gemm_sm80_s16816tf32spgemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s1688bf16gemm.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s1688f16gemm.so [ 95%] Built target cutlass_library_gemm_sm80_s16832spgemm_bf16 [ 95%] Built target cutlass_library_gemm_sm80_s16832spgemm_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s1688gemm.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s1688gemm_tf32.so [ 95%] Built target cutlass_library_gemm_sm80_s1688bf16gemm [ 95%] Built target cutlass_library_gemm_sm80_s1688f16gemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s1688tf32gemm.so [ 95%] Built target cutlass_library_gemm_sm80_s1688gemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s4_i168128spgemm_s4.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s4_i16864gemm_s4.so [ 95%] Built target cutlass_library_gemm_sm80_s1688gemm_tf32 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s8_i16832gemm_s4_s8.so [ 95%] Built target cutlass_library_gemm_sm80_s1688tf32gemm [ 95%] Built target cutlass_library_gemm_sm80_s4_i168128spgemm_s4 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s8_i16832gemm_s8.so [ 95%] Built target cutlass_library_gemm_sm80_s4_i16864gemm_s4 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s8_i16832gemm_s8_s4.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_s8_i16864spgemm_s8.so [ 95%] Built target cutlass_library_gemm_sm80_s8_i16832gemm_s4_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_sgemm.so [ 95%] Built target cutlass_library_gemm_sm80_s8_i16832gemm_s8 [ 95%] Built target cutlass_library_gemm_sm80_s8_i16832gemm_s8_s4 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_tf32_s1688gemm_tf32.so [ 95%] Built target cutlass_library_gemm_sm80_s8_i16864spgemm_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_u4_i16864gemm_u4.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_u8_i16832gemm_u8.so [ 95%] Built target cutlass_library_gemm_sm80_sgemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm80_z884gemm.so [ 95%] Built target cutlass_library_gemm_sm80_tf32_s1688gemm_tf32 [ 95%] Built target cutlass_library_gemm_sm80_u4_i16864gemm_u4 [ 95%] Linking CUDA shared library libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3.so [ 95%] Built target cutlass_library_gemm_sm80_u8_i16832gemm_u8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2.so [ 95%] Built target cutlass_library_gemm_sm80_z884gemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3.so [ 95%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3 [ 95%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm89_s16864spgemm_e4m3.so [ 95%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm89_s16864spgemm_e4m3_e5m2.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm89_s16864spgemm_e5m2.so [ 95%] Built target cutlass_library_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3 [ 95%] Built target cutlass_library_gemm_sm89_s16864spgemm_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm89_s16864spgemm_e5m2_e4m3.so [ 95%] Built target cutlass_library_gemm_sm89_s16864spgemm_e4m3_e5m2 [ 95%] Built target cutlass_library_gemm_sm89_s16864spgemm_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x16gemm_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2.so [ 95%] Built target cutlass_library_gemm_sm89_s16864spgemm_e5m2_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3 [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x16gemm_bf16 [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x32spgemm_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3.so [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3 [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x32spgemm_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3.so [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_d1684gemm.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x16gemm_f16.so [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2 [ 95%] Built target cutlass_library_gemm_sm90_d1684gemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x16gemm_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3.so [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2 [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x32spgemm_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3.so [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3 [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x32spgemm_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3.so [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_gz1684gemm.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_h64x128x16gemm.so [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_h64x128x32spgemm.so [ 95%] Built target cutlass_library_gemm_sm90_gz1684gemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_i64x128x32gemm_s8.so [ 95%] Built target cutlass_library_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_i64x128x32gemm_u8.so [ 95%] Built target cutlass_library_gemm_sm90_h64x128x16gemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_i64x128x64spgemm_s8.so [ 95%] Built target cutlass_library_gemm_sm90_i64x128x32gemm_s8 [ 95%] Built target cutlass_library_gemm_sm90_h64x128x32spgemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_i64x128x64spgemm_u8.so [ 95%] Built target cutlass_library_gemm_sm90_i64x128x32gemm_u8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x16gemm_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x16gemm_f16.so [ 95%] Built target cutlass_library_gemm_sm90_i64x128x64spgemm_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x16spgemm_tf32.so [ 95%] Built target cutlass_library_gemm_sm90_i64x128x64spgemm_u8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x16tf32spgemm.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x16gemm_bf16 [ 95%] Built target cutlass_library_gemm_sm90_s64x128x16gemm_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x32gemm_e4m3.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x32gemm_e4m3_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x16spgemm_tf32 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x32gemm_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x16tf32spgemm [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x32gemm_e5m2_e4m3.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e4m3 [ 95%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e4m3_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x32spgemm_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x32spgemm_f16.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x64spgemm_e4m3.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x32gemm_e5m2_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x64spgemm_e4m3_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x32spgemm_bf16 [ 95%] Built target cutlass_library_gemm_sm90_s64x128x32spgemm_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x64spgemm_e5m2.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x64spgemm_e5m2_e4m3.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x8gemm_tf32.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e4m3_e5m2 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s64x128x8tf32gemm.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2 [ 95%] Built target cutlass_library_gemm_sm90_s64x128x64spgemm_e5m2_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s8_i64x128x32gemm_s8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s8_i64x128x32gemm_u8.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x8gemm_tf32 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s8_i64x128x64spgemm_s8.so [ 95%] Built target cutlass_library_gemm_sm90_s64x128x8tf32gemm [ 95%] Built target cutlass_library_gemm_sm90_s8_i64x128x32gemm_s8 [ 95%] Built target cutlass_library_gemm_sm90_s8_i64x128x32gemm_u8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_s8_i64x128x64spgemm_u8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_h64x128x16gemm.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_h64x128x32spgemm.so [ 95%] Built target cutlass_library_gemm_sm90_s8_i64x128x64spgemm_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_i64x128x32gemm_s8.so [ 95%] Built target cutlass_library_gemm_sm90_void_h64x128x16gemm [ 95%] Built target cutlass_library_gemm_sm90_s8_i64x128x64spgemm_u8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_i64x128x32gemm_u8.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_i64x128x64spgemm_u8.so [ 95%] Built target cutlass_library_gemm_sm90_void_h64x128x32spgemm [ 95%] Built target cutlass_library_gemm_sm90_void_i64x128x32gemm_s8 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x16gemm_bf16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x16gemm_f16.so [ 95%] Built target cutlass_library_gemm_sm90_void_i64x128x32gemm_u8 [ 95%] Built target cutlass_library_gemm_sm90_void_i64x128x64spgemm_u8 [ 95%] Linking CUDA shared library libcutlass_rank_k_sm80_c1688tf32syrk.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3.so [ 95%] Built target cutlass_library_rank_k_sm80_c1688tf32syrk [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x16gemm_bf16 [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3 [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x16gemm_f16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x32spgemm_bf16.so [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2 [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2 [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3 [ 95%] Linking CUDA shared library libcutlass_rank_k_sm80_z884herk.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x32spgemm_f16.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3.so [ 95%] Built target cutlass_library_rank_k_sm80_z884herk [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2.so [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x32spgemm_bf16 [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3.so [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2 [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2 [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x32spgemm_f16 [ 95%] Linking CUDA shared library libcutlass_conv2d_sm50_cf32_cdgrad_optimized_cf32.so [ 95%] Linking CUDA shared library libcutlass_gemm_sm90_z1684gemm.so [ 95%] Built target cutlass_library_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3 [ 95%] Linking CUDA shared library libcutlass_conv2d_sm50_cf32_cfprop_optimized_cf32.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm50_cf32_cwgrad_optimized_cf32.so [ 95%] Built target cutlass_library_conv2d_sm50_cf32_cdgrad_optimized_cf32 [ 95%] Built target cutlass_library_conv2d_sm50_cf32_cfprop_optimized_cf32 [ 95%] Built target cutlass_library_gemm_sm90_z1684gemm [ 95%] Linking CUDA shared library libcutlass_conv2d_sm50_sdgrad_optimized.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm50_sfprop_optimized.so [ 95%] Built target cutlass_library_conv2d_sm50_cf32_cwgrad_optimized_cf32 [ 95%] Linking CUDA shared library libcutlass_conv2d_sm50_swgrad_optimized.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm60_hfprop_optimized.so [ 95%] Built target cutlass_library_conv2d_sm50_sdgrad_optimized [ 95%] Built target cutlass_library_conv2d_sm50_sfprop_optimized [ 95%] Built target cutlass_library_conv2d_sm50_swgrad_optimized [ 95%] Linking CUDA shared library libcutlass_conv2d_sm70_f16_s884dgrad_optimized_f16.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm70_f16_s884fprop_optimized_f16.so [ 95%] Built target cutlass_library_conv2d_sm60_hfprop_optimized [ 95%] Linking CUDA shared library libcutlass_conv2d_sm70_f16_s884wgrad_optimized_f16.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm70_h884dgrad_optimized.so [ 95%] Built target cutlass_library_conv2d_sm70_f16_s884fprop_optimized_f16 [ 95%] Built target cutlass_library_conv2d_sm70_f16_s884dgrad_optimized_f16 [ 95%] Built target cutlass_library_conv2d_sm70_f16_s884wgrad_optimized_f16 [ 95%] Linking CUDA shared library libcutlass_conv2d_sm70_h884fprop_optimized.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm70_h884wgrad_optimized.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm70_s884dgrad_optimized_f16.so [ 95%] Built target cutlass_library_conv2d_sm70_h884dgrad_optimized [ 95%] Linking CUDA shared library libcutlass_conv2d_sm70_s884fprop_optimized_f16.so [ 95%] Built target cutlass_library_conv2d_sm70_h884wgrad_optimized [ 95%] Built target cutlass_library_conv2d_sm70_h884fprop_optimized [ 95%] Built target cutlass_library_conv2d_sm70_s884dgrad_optimized_f16 [ 95%] Linking CUDA shared library libcutlass_conv2d_sm70_s884wgrad_optimized_f16.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_cf32_cdgrad_optimized_cf32.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_cf32_cfprop_optimized_cf32.so [ 95%] Built target cutlass_library_conv2d_sm70_s884fprop_optimized_f16 [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_cf32_cwgrad_optimized_cf32.so [ 95%] Built target cutlass_library_conv2d_sm70_s884wgrad_optimized_f16 [ 95%] Built target cutlass_library_conv2d_sm75_cf32_cdgrad_optimized_cf32 [ 95%] Built target cutlass_library_conv2d_sm75_cf32_cfprop_optimized_cf32 [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_f16_s1688dgrad_optimized_f16.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_f16_s1688fprop_few_channels_f16.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_f16_s1688fprop_fixed_channels_f16.so [ 95%] Built target cutlass_library_conv2d_sm75_cf32_cwgrad_optimized_cf32 [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_f16_s1688fprop_optimized_f16.so [ 95%] Built target cutlass_library_conv2d_sm75_f16_s1688fprop_few_channels_f16 [ 95%] Built target cutlass_library_conv2d_sm75_f16_s1688fprop_fixed_channels_f16 [ 95%] Built target cutlass_library_conv2d_sm75_f16_s1688dgrad_optimized_f16 [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_f16_s1688wgrad_optimized_f16.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_h1688fprop_few_channels.so [ 95%] Linking CUDA shared library libcutlass_conv2d_sm75_h1688dgrad_optimized.so [ 95%] Built target cutlass_library_conv2d_sm75_f16_s1688fprop_optimized_f16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_h1688fprop_fixed_channels.so [ 96%] Built target cutlass_library_conv2d_sm75_h1688fprop_few_channels [ 96%] Built target cutlass_library_conv2d_sm75_h1688dgrad_optimized [ 96%] Built target cutlass_library_conv2d_sm75_f16_s1688wgrad_optimized_f16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_h1688wgrad_optimized.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_i8816fprop_optimized_s8.so [ 96%] Built target cutlass_library_conv2d_sm75_h1688fprop_fixed_channels [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_h1688fprop_optimized.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_i8816fprop_optimized_u8.so [ 96%] Built target cutlass_library_conv2d_sm75_h1688wgrad_optimized [ 96%] Built target cutlass_library_conv2d_sm75_i8816fprop_optimized_s8 [ 96%] Built target cutlass_library_conv2d_sm75_h1688fprop_optimized [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_i8832fprop_optimized_s4.so [ 96%] Built target cutlass_library_conv2d_sm75_i8816fprop_optimized_u8 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_i8832fprop_optimized_u4.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_s1688dgrad_optimized_f16.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_s1688fprop_few_channels_f16.so [ 96%] Built target cutlass_library_conv2d_sm75_i8832fprop_optimized_s4 [ 96%] Built target cutlass_library_conv2d_sm75_i8832fprop_optimized_u4 [ 96%] Built target cutlass_library_conv2d_sm75_s1688dgrad_optimized_f16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_s1688fprop_fixed_channels_f16.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_s1688fprop_optimized_f16.so [ 96%] Built target cutlass_library_conv2d_sm75_s1688fprop_few_channels_f16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_s1688wgrad_optimized_f16.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_s4_i8832fprop_optimized_s4.so [ 96%] Built target cutlass_library_conv2d_sm75_s1688fprop_fixed_channels_f16 [ 96%] Built target cutlass_library_conv2d_sm75_s1688fprop_optimized_f16 [ 96%] Built target cutlass_library_conv2d_sm75_s1688wgrad_optimized_f16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_s8_i8816fprop_few_channels_s8.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_s8_i8816fprop_fixed_channels_s8.so [ 96%] Built target cutlass_library_conv2d_sm75_s4_i8832fprop_optimized_s4 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_s8_i8816fprop_optimized_s8.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_u4_i8832fprop_optimized_u4.so [ 96%] Built target cutlass_library_conv2d_sm75_s8_i8816fprop_few_channels_s8 [ 96%] Built target cutlass_library_conv2d_sm75_s8_i8816fprop_fixed_channels_s8 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_u8_i8816fprop_few_channels_u8.so [ 96%] Built target cutlass_library_conv2d_sm75_s8_i8816fprop_optimized_s8 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_u8_i8816fprop_fixed_channels_u8.so [ 96%] Built target cutlass_library_conv2d_sm75_u4_i8832fprop_optimized_u4 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm75_u8_i8816fprop_optimized_u8.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_bf16_s16816dgrad_optimized_bf16.so [ 96%] Built target cutlass_library_conv2d_sm75_u8_i8816fprop_few_channels_u8 [ 96%] Built target cutlass_library_conv2d_sm75_u8_i8816fprop_fixed_channels_u8 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16.so [ 96%] Built target cutlass_library_conv2d_sm75_u8_i8816fprop_optimized_u8 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_bf16_s16816fprop_optimized_bf16.so [ 96%] Built target cutlass_library_conv2d_sm80_bf16_s16816dgrad_optimized_bf16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_bf16_s16816wgrad_optimized_bf16.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_f16_s16816dgrad_optimized_f16.so [ 96%] Built target cutlass_library_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_f16_s16816fprop_fixed_channels_f16.so [ 96%] Built target cutlass_library_conv2d_sm80_bf16_s16816fprop_optimized_bf16 [ 96%] Built target cutlass_library_conv2d_sm80_bf16_s16816wgrad_optimized_bf16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_f16_s16816fprop_optimized_f16.so [ 96%] Built target cutlass_library_conv2d_sm80_f16_s16816dgrad_optimized_f16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_f16_s16816wgrad_optimized_f16.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_h16816dgrad_optimized.so [ 96%] Built target cutlass_library_conv2d_sm80_f16_s16816fprop_fixed_channels_f16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_h16816fprop_fixed_channels.so [ 96%] Built target cutlass_library_conv2d_sm80_f16_s16816wgrad_optimized_f16 [ 96%] Built target cutlass_library_conv2d_sm80_f16_s16816fprop_optimized_f16 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_h16816fprop_optimized.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_h16816wgrad_optimized.so [ 96%] Built target cutlass_library_conv2d_sm80_h16816dgrad_optimized [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_i16832fprop_optimized_s8.so [ 96%] Built target cutlass_library_conv2d_sm80_h16816fprop_fixed_channels [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_i16832fprop_optimized_u8.so [ 96%] Built target cutlass_library_conv2d_sm80_h16816wgrad_optimized [ 96%] Built target cutlass_library_conv2d_sm80_h16816fprop_optimized [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_i16864fprop_optimized_s4.so [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_i16864fprop_optimized_u4.so [ 96%] Built target cutlass_library_conv2d_sm80_i16832fprop_optimized_s8 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_s16816dgrad_optimized_bf16.so [ 96%] Built target cutlass_library_conv2d_sm80_i16832fprop_optimized_u8 [ 96%] Linking CUDA shared library libcutlass_conv2d_sm80_s16816dgrad_optimized_f16.so [ 96%] Built target cutlass_library_conv2d_sm80_i16864fprop_optimized_u4 [ 96%] Built target cutlass_library_conv2d_sm80_i16864fprop_optimized_s4 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s16816fprop_fixed_channels_bf16.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s16816fprop_fixed_channels_f16.so [ 97%] Built target cutlass_library_conv2d_sm80_s16816dgrad_optimized_bf16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s16816fprop_optimized_bf16.so [ 97%] Built target cutlass_library_conv2d_sm80_s16816dgrad_optimized_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s16816fprop_optimized_f16.so [ 97%] Built target cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_f16 [ 97%] Built target cutlass_library_conv2d_sm80_s16816fprop_fixed_channels_bf16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s16816wgrad_optimized_bf16.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s16816wgrad_optimized_f16.so [ 97%] Built target cutlass_library_conv2d_sm80_s16816fprop_optimized_bf16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688bf16dgrad_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s16816fprop_optimized_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688bf16fprop_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s16816wgrad_optimized_f16 [ 97%] Built target cutlass_library_conv2d_sm80_s16816wgrad_optimized_bf16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688bf16wgrad_optimized.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688dgrad_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688bf16dgrad_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688dgrad_optimized_tf32.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688bf16fprop_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688f16dgrad_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688bf16wgrad_optimized [ 97%] Built target cutlass_library_conv2d_sm80_s1688dgrad_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688f16fprop_optimized.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688f16wgrad_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688dgrad_optimized_tf32 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688fprop_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688f16dgrad_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688fprop_optimized_tf32.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688f16wgrad_optimized [ 97%] Built target cutlass_library_conv2d_sm80_s1688f16fprop_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688tf32dgrad_optimized.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688tf32fprop_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688fprop_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688tf32wgrad_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688fprop_optimized_tf32 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688wgrad_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688tf32dgrad_optimized [ 97%] Built target cutlass_library_conv2d_sm80_s1688tf32fprop_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s1688wgrad_optimized_tf32.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s4_i16864fprop_optimized_s4.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688tf32wgrad_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s8_i16832fprop_few_channels_s8.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688wgrad_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s8_i16832fprop_fixed_channels_s8.so [ 97%] Built target cutlass_library_conv2d_sm80_s1688wgrad_optimized_tf32 [ 97%] Built target cutlass_library_conv2d_sm80_s4_i16864fprop_optimized_s4 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_s8_i16832fprop_optimized_s8.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_sdgrad_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s8_i16832fprop_few_channels_s8 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_sfprop_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_s8_i16832fprop_fixed_channels_s8 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_swgrad_optimized.so [ 97%] Built target cutlass_library_conv2d_sm80_sdgrad_optimized [ 97%] Built target cutlass_library_conv2d_sm80_s8_i16832fprop_optimized_s8 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_tf32_s1688dgrad_optimized_tf32.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_tf32_s1688fprop_optimized_tf32.so [ 97%] Built target cutlass_library_conv2d_sm80_sfprop_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_tf32_s1688wgrad_optimized_tf32.so [ 97%] Built target cutlass_library_conv2d_sm80_swgrad_optimized [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_u4_i16864fprop_optimized_u4.so [ 97%] Built target cutlass_library_conv2d_sm80_tf32_s1688fprop_optimized_tf32 [ 97%] Built target cutlass_library_conv2d_sm80_tf32_s1688dgrad_optimized_tf32 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_u8_i16832fprop_few_channels_u8.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_u8_i16832fprop_fixed_channels_u8.so [ 97%] Built target cutlass_library_conv2d_sm80_tf32_s1688wgrad_optimized_tf32 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm80_u8_i16832fprop_optimized_u8.so [ 97%] Built target cutlass_library_conv2d_sm80_u4_i16864fprop_optimized_u4 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm80_u8_i16832fprop_few_channels_u8 [ 97%] Built target cutlass_library_conv2d_sm80_u8_i16832fprop_fixed_channels_u8 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm80_u8_i16832fprop_optimized_u8 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Built target cutlass_library_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Built target cutlass_library_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Built target cutlass_library_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Built target cutlass_library_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Built target cutlass_library_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Built target cutlass_library_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 97%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 97%] Built target cutlass_library_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 98%] Built target cutlass_library_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 98%] Built target cutlass_library_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so [ 98%] Built target cutlass_library_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so [ 98%] Built target cutlass_library_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so [ 98%] Built target cutlass_library_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 98%] Built target cutlass_library_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so [ 98%] Built target cutlass_library_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so [ 98%] Built target cutlass_library_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so [ 98%] Built target cutlass_library_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32 [ 98%] Built target cutlass_library_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32 [ 98%] Built target cutlass_library_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so [ 98%] Built target cutlass_library_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32 [ 98%] Built target cutlass_library_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32 [ 98%] Built target cutlass_library_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so [ 98%] Built target cutlass_library_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32 [ 98%] Linking CUDA shared library libcutlass_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so [ 99%] Built target cutlass_library_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32 [ 99%] Built target cutlass_library_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32 [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32.so [ 99%] Built target cutlass_library_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32 [ 99%] Built target cutlass_library_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32 [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32.so [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so [ 99%] Built target cutlass_library_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32 [ 99%] Built target cutlass_library_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32 [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so [ 99%] Built target cutlass_library_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32 [ 99%] Built target cutlass_library_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32 [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so [ 99%] Linking CUDA shared library libcutlass_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so [ 99%] Built target cutlass_library_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32 [ 99%] Built target cutlass_library_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32 [ 99%] Built target cutlass_library_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16.so [ 99%] Built target cutlass_library_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16.so [ 99%] Built target cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16 [ 99%] Built target cutlass_library_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16 [ 99%] Built target cutlass_library_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_f16_s16816dgrad3d_analytic_f16.so [ 99%] Built target cutlass_library_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_f16_s16816dgrad3d_optimized_f16.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_f16_s16816fprop3d_optimized_f16.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_f16_s16816wgrad3d_optimized_f16.so [ 99%] Built target cutlass_library_conv3d_sm80_f16_s16816dgrad3d_analytic_f16 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_h16816dgrad3d_analytic.so [ 99%] Built target cutlass_library_conv3d_sm80_f16_s16816fprop3d_optimized_f16 [ 99%] Built target cutlass_library_conv3d_sm80_f16_s16816dgrad3d_optimized_f16 [ 99%] Built target cutlass_library_conv3d_sm80_f16_s16816wgrad3d_optimized_f16 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_h16816dgrad3d_optimized.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_h16816fprop3d_optimized.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_h16816wgrad3d_optimized.so [ 99%] Built target cutlass_library_conv3d_sm80_h16816dgrad3d_analytic [ 99%] Built target cutlass_library_conv3d_sm80_h16816fprop3d_optimized [ 99%] Built target cutlass_library_conv3d_sm80_h16816dgrad3d_optimized [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_s16816dgrad3d_analytic_bf16.so [ 99%] Built target cutlass_library_conv3d_sm80_h16816wgrad3d_optimized [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_s16816dgrad3d_analytic_f16.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_s16816dgrad3d_optimized_bf16.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_s16816dgrad3d_optimized_f16.so [ 99%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_bf16 [ 99%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_analytic_f16 [ 99%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_bf16 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_s16816fprop3d_optimized_bf16.so [ 99%] Built target cutlass_library_conv3d_sm80_s16816dgrad3d_optimized_f16 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_s16816fprop3d_optimized_f16.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_s16816wgrad3d_optimized_bf16.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm80_s16816wgrad3d_optimized_f16.so [ 99%] Built target cutlass_library_conv3d_sm80_s16816fprop3d_optimized_bf16 [ 99%] Built target cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_bf16 [ 99%] Built target cutlass_library_conv3d_sm80_s16816fprop3d_optimized_f16 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32.so [ 99%] Built target cutlass_library_conv3d_sm80_s16816wgrad3d_optimized_f16 [ 99%] Linking CUDA shared library libcutlass_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32.so [ 99%] Linking CUDA shared library libcutlass_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32.so [ 99%] Built target cutlass_library_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32 [ 99%] Built target cutlass_library_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32 [ 99%] Linking CUDA shared library libcutlass_rank_k_sm80_c1688herk.so [ 99%] Built target cutlass_library_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32 [ 99%] Built target cutlass_library_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32 [ 99%] Linking CUDA shared library libcutlass_rank_k_sm80_c1688syrk.so [ 99%] Linking CUDA shared library libcutlass_rank_k_sm80_c1688tf32herk.so [ 99%] Linking CUDA shared library libcutlass_rank_k_sm80_d884syrk.so [ 99%] Built target cutlass_library_rank_k_sm80_c1688herk [ 99%] Linking CUDA shared library libcutlass_rank_k_sm80_gz884herk.so [ 99%] Built target cutlass_library_rank_k_sm80_c1688tf32herk [ 99%] Built target cutlass_library_rank_k_sm80_c1688syrk [ 99%] Built target cutlass_library_rank_k_sm80_d884syrk [ 99%] Linking CUDA shared library libcutlass_rank_k_sm80_gz884syrk.so [ 99%] Linking CUDA shared library libcutlass_rank_k_sm80_s1688syrk.so [ 99%] Linking CUDA shared library libcutlass_rank_k_sm80_s1688tf32syrk.so [ 99%] Built target cutlass_library_rank_k_sm80_gz884herk [ 99%] Built target cutlass_library_rank_k_sm80_gz884syrk [ 99%] Linking CUDA shared library libcutlass_rank_k_sm80_z884syrk.so [ 99%] Built target cutlass_library_rank_k_sm80_s1688syrk [ 99%] Linking CUDA shared library libcutlass_rank_k_sm90_d1684syrk.so [ 99%] Built target cutlass_library_rank_k_sm80_s1688tf32syrk [ 99%] Linking CUDA shared library libcutlass_rank_k_sm90_gz1684herk.so [ 99%] Linking CUDA shared library libcutlass_rank_k_sm90_gz1684syrk.so [ 99%] Built target cutlass_library_rank_k_sm80_z884syrk [ 99%] Built target cutlass_library_rank_k_sm90_d1684syrk [ 99%] Linking CUDA shared library libcutlass_rank_k_sm90_z1684herk.so [ 99%] Built target cutlass_library_rank_k_sm90_gz1684herk [ 99%] Built target cutlass_library_rank_k_sm90_gz1684syrk [ 99%] Linking CUDA shared library libcutlass_rank_k_sm90_z1684syrk.so [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_c1688her2k.so [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_c1688syr2k.so [ 99%] Built target cutlass_library_rank_k_sm90_z1684herk [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_c1688tf32her2k.so [ 99%] Built target cutlass_library_rank_k_sm90_z1684syrk [ 99%] Built target cutlass_library_rank_2k_sm80_c1688her2k [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_c1688tf32syr2k.so [ 99%] Built target cutlass_library_rank_2k_sm80_c1688syr2k [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_d884syr2k.so [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_gz884her2k.so [ 99%] Built target cutlass_library_rank_2k_sm80_c1688tf32her2k [ 99%] Built target cutlass_library_rank_2k_sm80_c1688tf32syr2k [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_gz884syr2k.so [ 99%] Built target cutlass_library_rank_2k_sm80_d884syr2k [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_s1688syr2k.so [ 99%] Built target cutlass_library_rank_2k_sm80_gz884her2k [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_s1688tf32syr2k.so [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_z884her2k.so [ 99%] Built target cutlass_library_rank_2k_sm80_gz884syr2k [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm80_z884syr2k.so [ 99%] Built target cutlass_library_rank_2k_sm80_s1688syr2k [ 99%] Built target cutlass_library_rank_2k_sm80_s1688tf32syr2k [ 99%] Built target cutlass_library_rank_2k_sm80_z884her2k [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm90_d1684syr2k.so [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm90_gz1684her2k.so [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm90_gz1684syr2k.so [ 99%] Built target cutlass_library_rank_2k_sm80_z884syr2k [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm90_z1684her2k.so [ 99%] Built target cutlass_library_rank_2k_sm90_d1684syr2k [ 99%] Built target cutlass_library_rank_2k_sm90_gz1684her2k [ 99%] Built target cutlass_library_rank_2k_sm90_gz1684syr2k [ 99%] Linking CUDA shared library libcutlass_rank_2k_sm90_z1684syr2k.so [ 99%] Linking CUDA shared library libcutlass_trmm_sm80_c1688tf32trmm.so [ 99%] Linking CUDA shared library libcutlass_trmm_sm80_c1688trmm.so [ 99%] Built target cutlass_library_rank_2k_sm90_z1684her2k [ 99%] Linking CUDA shared library libcutlass_trmm_sm80_d884trmm.so [ 99%] Built target cutlass_library_rank_2k_sm90_z1684syr2k [ 99%] Linking CUDA shared library libcutlass_trmm_sm80_gz884trmm.so [ 99%] Built target cutlass_library_trmm_sm80_c1688tf32trmm [ 99%] Built target cutlass_library_trmm_sm80_c1688trmm [ 99%] Linking CUDA shared library libcutlass_trmm_sm80_s1688tf32trmm.so [ 99%] Linking CUDA shared library libcutlass_trmm_sm80_s1688trmm.so [ 99%] Built target cutlass_library_trmm_sm80_d884trmm [ 99%] Linking CUDA shared library libcutlass_trmm_sm80_z884trmm.so [ 99%] Built target cutlass_library_trmm_sm80_gz884trmm [ 99%] Linking CUDA shared library libcutlass_trmm_sm90_d1684trmm.so [ 99%] Built target cutlass_library_trmm_sm80_s1688tf32trmm [ 99%] Built target cutlass_library_trmm_sm80_s1688trmm [ 99%] Linking CUDA shared library libcutlass_trmm_sm90_gz1684trmm.so [ 99%] Linking CUDA shared library libcutlass_trmm_sm90_z1684trmm.so [ 99%] Built target cutlass_library_trmm_sm80_z884trmm [ 99%] Linking CUDA shared library libcutlass_symm_sm80_c1688hemm.so [ 99%] Built target cutlass_library_trmm_sm90_d1684trmm [ 99%] Linking CUDA shared library libcutlass_symm_sm80_c1688symm.so [ 99%] Built target cutlass_library_trmm_sm90_gz1684trmm [ 99%] Built target cutlass_library_trmm_sm90_z1684trmm [ 99%] Linking CUDA shared library libcutlass_symm_sm80_c1688tf32hemm.so [ 99%] Linking CUDA shared library libcutlass_symm_sm80_c1688tf32symm.so [ 99%] Built target cutlass_library_symm_sm80_c1688hemm [ 99%] Linking CUDA shared library libcutlass_symm_sm80_d884symm.so [ 99%] Built target cutlass_library_symm_sm80_c1688symm [ 99%] Linking CUDA shared library libcutlass_symm_sm80_gz884hemm.so [ 99%] Built target cutlass_library_symm_sm80_c1688tf32hemm [ 99%] Built target cutlass_library_symm_sm80_c1688tf32symm [ 99%] Linking CUDA shared library libcutlass_symm_sm80_gz884symm.so [ 99%] Linking CUDA shared library libcutlass_symm_sm80_s1688symm.so [ 99%] Built target cutlass_library_symm_sm80_d884symm [ 99%] Linking CUDA shared library libcutlass_symm_sm80_s1688tf32symm.so [ 99%] Built target cutlass_library_symm_sm80_gz884hemm [ 99%] Linking CUDA shared library libcutlass_symm_sm80_z884hemm.so [ 99%] Built target cutlass_library_symm_sm80_gz884symm [ 99%] Built target cutlass_library_symm_sm80_s1688symm [ 99%] Linking CUDA shared library libcutlass_symm_sm80_z884symm.so [ 99%] Linking CUDA shared library libcutlass_symm_sm90_d1684symm.so [ 99%] Built target cutlass_library_symm_sm80_s1688tf32symm [ 99%] Linking CUDA shared library libcutlass_symm_sm90_gz1684hemm.so [ 99%] Built target cutlass_library_symm_sm80_z884hemm [ 99%] Linking CUDA shared library libcutlass_symm_sm90_gz1684symm.so [ 99%] Built target cutlass_library_symm_sm80_z884symm [ 99%] Built target cutlass_library_symm_sm90_d1684symm [ 99%] Linking CUDA shared library libcutlass_symm_sm90_z1684hemm.so [ 99%] Built target cutlass_library_symm_sm90_gz1684hemm [ 99%] Linking CXX static library libcutlass.a [ 99%] Built target cutlass_library_symm_sm90_gz1684symm [ 99%] Built target cutlass_library_symm_sm90_z1684hemm [ 99%] Linking CXX shared library libcutlass.so [ 99%] Built target cutlass_library_static [ 99%] Built target cutlass_library [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/cutlass_profiler.cu.o [ 99%] Building CXX object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/main.cpp.o [ 99%] Building CXX object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/performance_report.cpp.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/options.cu.o [ 99%] Building CXX object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/enumerated_types.cpp.o [ 99%] Building CXX object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/gpu_timer.cpp.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/device_allocation.cu.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/device_context.cu.o /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int2b_t]" at line 1065 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int4b_t]" at line 1073 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint1b_t]" at line 1113 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint2b_t]" at line 1121 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint4b_t]" at line 1129 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/profiler/src/options.cu: In constructor ‘cutlass::profiler::Options::Device::Device(const cutlass::CommandLine&)’: /builddir/build/BUILD/cutlass/tools/profiler/src/options.cu:126:35: warning: conversion from ‘size_t’ {aka ‘long unsigned int’} to ‘int’ may change value [-Wconversion] 126 | int cc = compute_capability(device_index); | ^~~~~~~~~~~~ [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/cublas_helpers.cu.o [ 99%] Building CXX object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/cudnn_helpers.cpp.o [ 99%] Building CXX object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/problem_space.cpp.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/operation_profiler.cu.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/gemm_operation_profiler.cu.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/rank_k_operation_profiler.cu.o /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int2b_t]" at line 1065 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int4b_t]" at line 1073 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint1b_t]" at line 1113 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint2b_t]" at line 1121 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint4b_t]" at line 1129 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/rank_2k_operation_profiler.cu.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/trmm_operation_profiler.cu.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/symm_operation_profiler.cu.o /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int2b_t]" at line 1065 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int4b_t]" at line 1073 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint1b_t]" at line 1113 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint2b_t]" at line 1121 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint4b_t]" at line 1129 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/conv2d_operation_profiler.cu.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/conv3d_operation_profiler.cu.o [ 99%] Building CUDA object tools/profiler/CMakeFiles/cutlass_profiler.dir/src/sparse_gemm_operation_profiler.cu.o /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int2b_t]" at line 1065 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int4b_t]" at line 1073 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint1b_t]" at line 1113 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint2b_t]" at line 1121 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint4b_t]" at line 1129 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int2b_t]" at line 1065 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int4b_t]" at line 1073 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint1b_t]" at line 1113 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint2b_t]" at line 1121 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint4b_t]" at line 1129 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu Remark: The warnings can be suppressed with "-diag-suppress " /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int2b_t]" at line 617 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::int4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::int4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::int4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::int4b_t]" at line 625 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint1b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint1b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint1b_t]" at line 665 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint2b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint2b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint2b_t]" at line 673 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(182): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(185): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(191): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomGaussianFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomGaussianFunc]" at line 423 instantiation of "void cutlass::reference::device::BlockFillRandomGaussian(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1844 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(542): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(545): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=double, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") if (params.exclude_zero >=0 && result == Element(0.0)) { ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(551): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") result = Element(rnd); ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "Element cutlass::reference::device::detail::RandomUniformFunc::operator()() [with Element=cutlass::uint4b_t]" at line 149 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::BlockForEach(Element *, size_t, Func::Params) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 122 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::BlockForEach::BlockForEach(Element *, size_t, Func::Params, int, int, cudaStream_t) [with Element=cutlass::uint4b_t, Func=cutlass::reference::device::detail::RandomUniformFunc]" at line 822 instantiation of "void cutlass::reference::device::BlockFillRandomUniform(Element *, size_t, uint64_t, cutlass::RealType::Type, cutlass::RealType::Type, int, double, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 1855 instantiation of "void cutlass::reference::device::BlockFillRandom(Element *, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element=cutlass::uint4b_t]" at line 681 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int2b_t]" at line 1065 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=true, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::int4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::int4b_t]" at line 1073 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=1, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint1b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint1b_t]" at line 1113 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=2, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint2b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint2b_t]" at line 1121 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h(1717): warning #1444-D: function "cutlass::integer_subbyte::integer_subbyte(T) [with Bits=4, Signed=false, T=float, Enable=void]" was declared deprecated ("Implicit conversion is deprecated; please use explicit construction instead") sum = Element(static_cast(sum) + ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h(77): note #3287-D: because of a "deprecated" attribute [[deprecated("Implicit conversion is deprecated; please use explicit construction instead")]] ^ detected during: instantiation of "void cutlass::reference::device::detail::TensorFillLinearFunc::operator()(const cutlass::reference::device::detail::TensorFillLinearFunc::TensorCoord &) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 82 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "cutlass::reference::device::kernel::detail::TensorForEachHelper::TensorForEachHelper(Func &, const cutlass::Coord &, cutlass::Coord &, int64_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1]" at line 109 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/kernel/tensor_foreach.h instantiation of "void cutlass::reference::device::kernel::TensorForEach(cutlass::Coord, Params) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 59 of /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_foreach.h instantiation of "cutlass::reference::device::TensorForEach::TensorForEach(cutlass::Coord, Params, int, int, cudaStream_t) [with Func=cutlass::reference::device::detail::TensorFillLinearFunc, Rank=1, Params=cutlass::reference::device::detail::TensorFillLinearFunc::Params]" at line 1752 instantiation of "void cutlass::reference::device::TensorFillLinear(cutlass::TensorView, const cutlass::Array> &, Element, cudaStream_t) [with Element=cutlass::uint4b_t, Layout=cutlass::layout::PackedVectorLayout]" at line 1817 instantiation of "void cutlass::reference::device::BlockFillSequential(Element *, int64_t, Element, Element) [with Element=cutlass::uint4b_t]" at line 1129 of /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu: In member function ‘void cutlass::profiler::DeviceAllocation::initialize_sequential_device(cutlass::Distribution)’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1065:175: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1065 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1065:223: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1065 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1073:175: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1073 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1073:223: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1073 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1113:178: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1113 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1113:227: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1113 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1121:178: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1121 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1121:227: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1121 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1129:178: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1129 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1129:227: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1129 | cutlass::reference::device::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu: In member function ‘void cutlass::profiler::DeviceAllocation::initialize_sequential_host(cutlass::Distribution)’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1295:181: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1295 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1295:229: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1295 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1303:181: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1303 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1303:229: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1303 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1343:184: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1343 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1343:233: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1343 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1351:184: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1351 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1351:233: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1351 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1359:184: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1359 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1359:233: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1359 | cutlass::reference::host::BlockFillSequential( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu: In static member function ‘static bool cutlass::profiler::DeviceAllocation::block_compare_relatively_equal(cutlass::library::NumericTypeID, const void*, const void*, size_t, double, double)’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1709:210: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1709 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1709:248: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1709 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1717:210: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1717 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1717:248: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1717 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1757:214: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1757 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1757:253: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1757 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1765:214: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1765 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1765:253: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1765 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1773:214: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1773 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:1773:253: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1773 | return reference::device::BlockCompareRelativelyEqual( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu: In member function ‘void cutlass::profiler::DeviceAllocation::fill_device(double)’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2198:75: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2198 | tensor_fill(*this, static_cast(val)); | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2202:75: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2202 | tensor_fill(*this, static_cast(val)); | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2222:77: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2222 | tensor_fill(*this, static_cast(val)); | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2226:77: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2226 | tensor_fill(*this, static_cast(val)); | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2230:77: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2230 | tensor_fill(*this, static_cast(val)); | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu: In member function ‘void cutlass::profiler::DeviceAllocation::fill_host(double)’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2329:151: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2329 | cutlass::reference::host::BlockFill( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2337:151: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2337 | cutlass::reference::host::BlockFill( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2377:154: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2377 | cutlass::reference::host::BlockFill( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2385:154: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2385 | cutlass::reference::host::BlockFill( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:2393:154: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 2393 | cutlass::reference::host::BlockFill( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h: In instantiation of ‘void cutlass::reference::device::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element = cutlass::integer_subbyte<2, true>; size_t = long unsigned int; uint64_t = long unsigned int; cudaStream_t = CUstream_st*]’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:617:74: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:57: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:99: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:56: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:96: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h: In instantiation of ‘void cutlass::reference::device::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element = cutlass::integer_subbyte<4, true>; size_t = long unsigned int; uint64_t = long unsigned int; cudaStream_t = CUstream_st*]’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:625:74: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:57: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:99: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:56: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:96: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h: In instantiation of ‘void cutlass::reference::device::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element = cutlass::integer_subbyte<1, false>; size_t = long unsigned int; uint64_t = long unsigned int; cudaStream_t = CUstream_st*]’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:665:75: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:57: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:99: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:56: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:96: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h: In instantiation of ‘void cutlass::reference::device::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element = cutlass::integer_subbyte<2, false>; size_t = long unsigned int; uint64_t = long unsigned int; cudaStream_t = CUstream_st*]’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:673:75: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:57: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:99: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:56: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:96: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h: In instantiation of ‘void cutlass::reference::device::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution, cudaStream_t) [with Element = cutlass::integer_subbyte<4, false>; size_t = long unsigned int; uint64_t = long unsigned int; cudaStream_t = CUstream_st*]’: /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:681:75: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:57: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1837:99: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1837 | BlockFillRandomGaussian( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:56: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/device/tensor_fill.h:1847:96: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 1847 | BlockFillRandomUniform( | ^ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomGaussianFunc::operator()() const [with Element = cutlass::integer_subbyte<2, true>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:571:55: required from ‘void cutlass::reference::host::BlockFillRandomGaussian(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<2, true>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1491:35: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<2, true>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:836:72: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:203:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 203 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:206:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 206 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:220:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 220 | result = Element(rnd); | ~^~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomUniformFunc::operator()() [with Element = cutlass::integer_subbyte<2, true>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1123:55: required from ‘void cutlass::reference::host::BlockFillRandomUniform(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<2, true>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1501:34: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<2, true>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:836:72: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:642:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 642 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:645:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 645 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:654:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 654 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomGaussianFunc::operator()() const [with Element = cutlass::integer_subbyte<4, true>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:571:55: required from ‘void cutlass::reference::host::BlockFillRandomGaussian(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<4, true>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1491:35: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<4, true>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:844:72: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:203:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 203 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:206:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 206 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:220:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 220 | result = Element(rnd); | ~^~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomUniformFunc::operator()() [with Element = cutlass::integer_subbyte<4, true>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1123:55: required from ‘void cutlass::reference::host::BlockFillRandomUniform(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<4, true>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1501:34: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<4, true>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:844:72: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:642:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 642 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:645:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 645 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:654:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = true]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 654 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomGaussianFunc::operator()() const [with Element = cutlass::integer_subbyte<1, false>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:571:55: required from ‘void cutlass::reference::host::BlockFillRandomGaussian(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<1, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1491:35: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<1, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:884:73: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:203:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 203 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:206:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 206 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:220:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 220 | result = Element(rnd); | ~^~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomUniformFunc::operator()() [with Element = cutlass::integer_subbyte<1, false>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1123:55: required from ‘void cutlass::reference::host::BlockFillRandomUniform(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<1, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1501:34: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<1, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:884:73: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:642:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 642 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:645:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 645 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:654:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 1; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 654 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomGaussianFunc::operator()() const [with Element = cutlass::integer_subbyte<2, false>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:571:55: required from ‘void cutlass::reference::host::BlockFillRandomGaussian(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<2, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1491:35: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<2, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:892:73: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:203:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 203 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:206:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 206 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:220:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 220 | result = Element(rnd); | ~^~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomUniformFunc::operator()() [with Element = cutlass::integer_subbyte<2, false>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1123:55: required from ‘void cutlass::reference::host::BlockFillRandomUniform(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<2, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1501:34: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<2, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:892:73: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:642:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 642 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:645:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 645 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:654:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 2; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 654 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomGaussianFunc::operator()() const [with Element = cutlass::integer_subbyte<4, false>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:571:55: required from ‘void cutlass::reference::host::BlockFillRandomGaussian(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<4, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1491:35: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<4, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:900:73: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:203:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 203 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:206:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 206 | result = static_cast(rnd); | ~^~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:220:11: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 220 | result = Element(rnd); | ~^~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h: In instantiation of ‘Element cutlass::reference::host::detail::RandomUniformFunc::operator()() [with Element = cutlass::integer_subbyte<4, false>]’: /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1123:55: required from ‘void cutlass::reference::host::BlockFillRandomUniform(Element*, size_t, uint64_t, double, double, int, double) [with Element = cutlass::integer_subbyte<4, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:1501:34: required from ‘void cutlass::reference::host::BlockFillRandom(Element*, size_t, uint64_t, cutlass::Distribution) [with Element = cutlass::integer_subbyte<4, false>; size_t = long unsigned int; uint64_t = long unsigned int]’ /builddir/build/BUILD/cutlass/tools/profiler/src/device_allocation.cu:900:73: required from here /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:642:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 642 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:645:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 645 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/cutlass/tools/util/include/cutlass/util/reference/host/tensor_fill.h:654:33: warning: ‘cutlass::integer_subbyte::integer_subbyte(T) [with T = double; Enable = void; int Bits = 4; bool Signed = false]’ is deprecated: Implicit conversion is deprecated; please use explicit construction instead [-Wdeprecated-declarations] 654 | result = static_cast(Real(rnd)); | ^~~~~~~~~ /builddir/build/BUILD/cutlass/include/cutlass/integer_subbyte.h:79:1: note: declared here 79 | integer_subbyte(T value) | ^ ~~~~~~~~~~~~~ [100%] Linking CXX executable cutlass_profiler [100%] Built target cutlass_profiler + popd ~/build/BUILD/cutlass + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.DFoLIT + umask 022 + cd /builddir/build/BUILD + '[' /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 '!=' / ']' + rm -rf /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 ++ dirname /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 + mkdir -p /builddir/build/BUILDROOT + mkdir /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 + CFLAGS=' ' + export CFLAGS + CXXFLAGS=' ' + export CXXFLAGS + FFLAGS=' -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS=' -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=gcc + export CC + CXX=g++ + export CXX + cd cutlass + rm -rf /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 + pushd build ~/build/BUILD/cutlass/build ~/build/BUILD/cutlass + DESTDIR=/builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 + /usr/bin/cmake --install . -- Install configuration: "Release" -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/axpby.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/clear.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/cooperative_copy.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/cooperative_gemm.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/copy.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/fill.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/functional.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/gemm.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/prefer.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/prefetch.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/tensor_algorithms.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/algorithm/tuple_algorithms.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/cluster_sm90.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/config.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/copy.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/copy_sm50.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/copy_sm75.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/copy_sm80.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/copy_sm90.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/copy_sm90_desc.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/copy_sm90_tma.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/mma.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/mma_sm61.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/mma_sm70.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/mma_sm75.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/mma_sm80.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/mma_sm90.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/mma_sm90_desc.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/mma_sm90_gmma.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/mma_sm90_gmma_sparse.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/arch/util.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/copy_atom.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/copy_traits.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/copy_traits_sm50.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/copy_traits_sm75.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/copy_traits_sm80.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/copy_traits_sm90.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/copy_traits_sm90_im2col.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/copy_traits_sm90_tma.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/copy_traits_sm90_tma_swizzle.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/mma_atom.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/mma_traits.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/mma_traits_sm61.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/mma_traits_sm70.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/mma_traits_sm75.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/mma_traits_sm80.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/mma_traits_sm90.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/mma_traits_sm90_gmma.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/atom/mma_traits_sm90_gmma_sparse.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/config.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container/alignment.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container/array.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container/array_aligned.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container/array_subbyte.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container/bit_field.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container/cuda_types.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container/packed_tuple.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container/tuple.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/container/type_list.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/int_tuple.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/layout.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/layout_composed.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric/arithmetic_tuple.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric/complex.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric/int.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric/integer_sequence.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric/integral_constant.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric/integral_ratio.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric/math.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric/numeric_types.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/numeric/real.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/pointer.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/pointer_base.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/pointer_flagged.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/pointer_sparse.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/pointer_swizzle.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/stride.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/swizzle.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/swizzle_layout.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/tensor.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/tensor_impl.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/tensor_predicate.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/tensor_zip.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/underscore.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/util -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/util/debug.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/util/print.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cute/util/type_traits.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/aligned_buffer.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/arch.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/barrier.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/cache_operation.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/config.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/grid_dependency_control.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/memory.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/memory_sm75.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/memory_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sm50.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sm60.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sm61.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sm70.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sm75.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sm89.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sm90.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sparse_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/mma_sparse_sm89.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/reg_reconfig.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/simd.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/simd_sm60.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/simd_sm61.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/synclog.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/wmma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/wmma_sm70.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/wmma_sm72.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/arch/wmma_sm75.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/array.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/array_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/array_subbyte.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/barrier.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/bfloat16.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/blas3.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/blas3_types.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/block_striped.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/cluster_launch.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/constants.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/collective -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/collective/builders -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/collective/builders/sm90_common.inl -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/collective/builders/sm90_gmma_builder.inl -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/collective/collective_builder.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/collective/collective_conv.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/collective/detail.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/collective/sm90_implicit_gemm_gmma_ss_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/conv2d_problem_size.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/conv3d_problem_size.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/convnd_problem_shape.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/convolution.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/detail.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/device -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/device/conv_universal_adapter.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/device/direct_convolution.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/device/implicit_gemm_convolution.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/device/implicit_gemm_convolution_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/dispatch_policy.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/conv_universal.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d_dgrad.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d_fprop.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d_fprop_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d_fprop_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d_fprop_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d_fprop_with_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d_group_fprop.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d_wgrad.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv2d_wgrad_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv3d_dgrad.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv3d_fprop.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv3d_fprop_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv3d_fprop_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_conv3d_wgrad.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_deconv2d.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_deconv2d_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_deconv3d.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_deconv3d_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/default_depthwise_fprop.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/direct_convolution.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/implicit_gemm_convolution.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/implicit_gemm_convolution_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/implicit_gemm_convolution_strided_dgrad.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/implicit_gemm_convolution_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/implicit_gemm_convolution_with_fused_epilogue.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/kernel/sm90_implicit_gemm_tma_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/thread -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/thread/depthwise_mma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_dgrad_filter_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_dgrad_output_gradient_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_few_channels.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_fixed_channels.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_fprop_activation_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_few_channels.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_fixed_channels.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_fprop_filter_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_params.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_tile_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_wgrad_activation_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv2d_wgrad_output_gradient_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_dgrad_filter_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_dgrad_output_gradient_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_fprop_activation_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_fprop_filter_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_params.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_wgrad_activation_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_analytic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/conv3d_wgrad_output_gradient_tile_access_iterator_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/depthwise_direct_conv_params.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_fixed_stride_dilation.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/depthwise_fprop_activation_tile_access_iterator_direct_conv_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/depthwise_fprop_direct_conv_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/depthwise_fprop_filter_tile_access_iterator_direct_conv_optimized.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/depthwise_fprop_pipelined.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/depthwise_mma_base.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/depthwise_mma_core_with_lane_access_size.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/implicit_gemm_fprop_fusion_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/implicit_gemm_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/implicit_gemm_pipelined.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/implicit_gemm_wgrad_fusion_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/predicated_scale_bias_vector_access_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/predicated_scale_bias_vector_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/threadblock/threadblock_swizzle.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/warp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/warp/mma_depthwise_simt.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/warp/mma_depthwise_simt_tile_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/conv/warp/scale_bias_relu_transform.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/coord.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/core_io.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/cuda_host_adapter.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/cutlass.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/detail -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/detail/collective.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/detail/dependent_false.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/detail/helper_macros.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/detail/layout.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/detail/mma.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/device_kernel.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/builders -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/builders/sm90_builder.inl -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/builders/sm90_common.inl -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/collective_builder.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/collective_epilogue.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/default_epilogue.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/default_epilogue_array.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/detail.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/epilogue_tensor_broadcast.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/sm70_epilogue_vectorized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/sm70_epilogue_vectorized_array.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/sm90_epilogue_array_tma_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/collective/sm90_epilogue_tma_warpspecialized_bias_elementwise.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/dispatch_policy.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/fusion -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/fusion/callbacks.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/fusion/operations.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/fusion/sm90_callbacks_tma_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/fusion/sm90_visitor_compute_tma_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/fusion/sm90_visitor_load_tma_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/fusion/sm90_visitor_store_tma_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/fusion/sm90_visitor_tma_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/fusion/sm90_visitor_topk_softmax.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/activation.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/conversion_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/detail.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_bias_elementwise.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_bias_relu.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_clamp.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_dgelu.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_drelu.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_gelu.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_generic.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_generic_with_scaling.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_hardswish.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_leaky_relu.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_params.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_relu.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_relu0.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_residual_block.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_sigmoid.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_silu.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_tensor_broadcast.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/linear_combination_with_elementwise.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/reduction_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/thread/scale_type.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_complex_tensor_op_blas3.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_direct_store.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_simt.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_tensor_op_blas3.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_volta_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_with_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_epilogue_wmma_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_thread_map_simt.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_thread_map_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_thread_map_volta_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/default_thread_map_wmma_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/direct_store_epilogue_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_base.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_base_streamk.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_depthwise.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_direct_store.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_gemm_k_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_smem_accumulator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_streamk_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_visitor_with_softmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_with_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_with_visitor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_with_visitor_callbacks.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/epilogue_workspace.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/fusion -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/fusion/visitor_2x.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/fusion/visitor_compute.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/fusion/visitor_load.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/fusion/visitor_store.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/fusion/visitors.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/interleaved_epilogue.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/output_iterator_parameter.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/output_tile_thread_map.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/predicated_tile_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/predicated_tile_iterator_affine_layout_params.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/predicated_tile_iterator_blas3.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/predicated_tile_iterator_conv.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/predicated_tile_iterator_direct_conv.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/predicated_tile_iterator_params.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/predicated_tile_iterator_predicates.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/predicated_tile_iterator_strided_dgrad.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/shared_load_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/shared_load_iterator_mixed.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/threadblock/shared_load_iterator_pitch_linear.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/fragment_iterator_complex_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/fragment_iterator_gaussian_complex_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/fragment_iterator_simt.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/fragment_iterator_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/fragment_iterator_volta_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/fragment_iterator_wmma_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/simt_policy.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/tensor_op_policy.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/tile_iterator_simt.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/tile_iterator_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/tile_iterator_tensor_op_mixed.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/tile_iterator_volta_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/tile_iterator_wmma_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/volta_tensor_op_policy.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/epilogue/warp/wmma_tensor_op_policy.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/fast_math.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/float8.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/floating_point_nvrtc.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/builders -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/builders/sm90_common.inl -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/builders/sm90_sparse_config.inl -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/builders/sm90_sparse_gmma_builder.inl -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/collective_builder.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/collective_builder_decl.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/collective_mma.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/collective_mma_decl.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/fp8_accumulation.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm70_mma_twostage.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm80_mma_multistage.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_rs_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm90_mma_multistage_gmma_ss_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized_mixed_input.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/collective/sm90_sparse_mma_tma_gmma_ss_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/base_grouped.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/default_gemm_configuration.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/ell_gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_array.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_batched.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_grouped.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_layernorm_mainloop_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_sparse.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_sparse_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_sparse_universal_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_sparse_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_sparse_with_visitor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_splitk_parallel.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_universal_adapter.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_universal_base.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_universal_streamk_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_universal_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_universal_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemm_with_k_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/gemv.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/rank_2k.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/rank_2k_grouped.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/rank_k.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/symm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/device/trmm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/dispatch_policy.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/gemm_enumerated_types.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/group_array_problem_shape.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_ell_gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_grouped.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_grouped_softmax_mainloop_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_layernorm_mainloop_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_planar_complex_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_sparse.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_sparse_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_sparse_universal_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_sparse_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_sparse_with_visitor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_splitk_parallel.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_streamk_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_universal_with_visitor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_with_broadcast.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_with_k_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemm_with_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_gemv.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_rank_2k.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_rank_2k_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_rank_2k_grouped.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_rank_2k_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_rank_k.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_rank_k_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_rank_k_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_symm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_symm_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_symm_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_trmm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_trmm_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/default_trmm_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/ell_gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_array.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_batched.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_grouped.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_grouped_problem_visitor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_grouped_softmax_mainloop_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_layernorm_mainloop_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_params.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_pipelined.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_planar_complex_array.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_sparse_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_sparse_universal_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_splitk_parallel.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_streamk_with_fused_epilogue.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_transpose_operands.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_universal.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_universal_decl.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_universal_streamk.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_universal_with_visitor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_universal_with_visitor_streamk.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_with_fused_epilogue.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemm_with_k_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemv.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/gemv_batched_strided.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/grouped_problem_visitor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/params_sparse_base.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/params_universal_base.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/rank_2k_grouped.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/rank_2k_grouped_problem_visitor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/rank_2k_transpose_operands.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/rank_2k_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/rank_k_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm70_gemm.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_gemm_tma.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_cooperative.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_gemm_tma_warpspecialized_pingpong.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_cooperative.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_gemm_warpspecialized_pingpong.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_tile_scheduler.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_tile_scheduler_group.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sm90_tile_scheduler_stream_k.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sparse_gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sparse_gemm_with_absmax.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/sparse_gemm_with_visitor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/static_tile_scheduler.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/symm_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/tile_scheduler.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/tile_scheduler_params.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/kernel/trmm_universal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/thread -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/thread/mma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/thread/mma_sm50.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/thread/mma_sm60.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/thread/mma_sm61.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_ell_mma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_gemv_core.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_core.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_core_simt.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_core_sm70.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_core_sm75.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_core_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_core_sparse_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_core_with_access_size.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_core_with_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_core_wmma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_layernorm_mainloop_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_planar_complex_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_planar_complex_pipelined.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_softmax_mainloop_fusion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_mma_with_reduction.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_multistage_mma_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_multistage_mma_complex_core_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_multistage_trmm_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_sparse_mma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/default_trmm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/ell_mma_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/ell_mma_pipelined.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/gemv.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/index_remat.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_base.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_blas3_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_layernorm_mainloop_fusion_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_pipelined.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_planar_complex_base.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_planar_complex_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_planar_complex_pipelined.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_singlestage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_softmax_mainloop_fusion_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_sparse_base.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_sparse_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/mma_with_reduction_multistage.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/threadblock_swizzle.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/threadblock/threadblock_swizzle_streamk.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/default_mma_complex_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/default_mma_sparse_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/default_mma_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/default_mma_tensor_op_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/default_mma_with_reduction_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/default_mma_wmma_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/layernorm_scale_bias_transform.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_complex_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_complex_tensor_op_fast_f32.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_complex_tensor_op_tile_iterator_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_gaussian_complex_tensor_op_tile_iterator_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_mixed_input_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_simt.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_simt_policy.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_simt_tile_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_sparse_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_fast_f32.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_fragment_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_policy.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_sm70.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_tile_access_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm70.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_sparse.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_tile_iterator_wmma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_tensor_op_wmma.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/mma_with_reduction_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/scale_bias_tile_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/softmax_scale_bias_transform.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm/warp/tile_iterator_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm_coord.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/gemm_coord.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/half.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/integer_subbyte.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/kernel_hardware_info.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/kernel_hardware_info.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/kernel_launch.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout/layout.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout/matrix.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout/permute.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout/pitch_linear.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout/tensor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout/tensor_op_multiplicand_sm70.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout/tensor_op_multiplicand_sm75.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout/tensor_op_multiplicand_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/layout/vector.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/matrix.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/matrix_coord.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/matrix_shape.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/numeric_conversion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/numeric_size.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/numeric_types.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/pipeline -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/pipeline/pipeline.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/pipeline/sm90_pipeline.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/pitch_linear_coord.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/platform -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/platform/platform.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/predicate_vector.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/quaternion.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/real.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/device -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/device/reduce_split_k.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/device/tensor_reduce.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/device/tensor_reduce_affine_contiguous.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/device/tensor_reduce_affine_strided.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/kernel -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/kernel/reduce_softmax_final.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/kernel/reduce_split_k.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/kernel/tensor_reduce_affine_contiguous.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/kernel/tensor_reduce_affine_strided.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/thread -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/thread/reduce.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/thread/reduction_operators.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/reduction/threadblock_swizzle.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/relatively_equal.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/semaphore.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/subbyte_reference.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/tensor_coord.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/tensor_ref.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/tensor_ref_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/tensor_view.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/tensor_view_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/tfloat32.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/thread -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/thread/matrix.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/trace.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/collective -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/collective/sm90_wgmma_transpose.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/device -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/device/transform_universal_adapter.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/kernel -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/kernel/filter_format_transformer.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/kernel/sparse_gemm_compressor.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/pitch_linear_thread_map.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/thread -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/thread/transpose.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/thread/unary_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/ell_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/ell_predicated_tile_access_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/ell_predicated_tile_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_scale_bias_vector_access_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_scale_bias_vector_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_tile_access_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_tile_access_iterator_params.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_tile_access_iterator_triangular_matrix.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_tile_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_tile_iterator_2dthreadtile.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_tile_iterator_triangular_matrix.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/predicated_vector_access_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_scale_bias_vector_access_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_access_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_access_iterator_pitch_linear_direct_conv.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_access_iterator_tensor_op_sm80.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_iterator_pitch_linear_2dthreadtile.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/regular_tile_iterator_tensor_op_sm70.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/threadblock/vector_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/warp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/transform/warp/vector_fragment_iterator.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/uint128.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/version.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/wmma_array.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/workspace.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/functional.h.fp16~ -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/functional.h -- Up-to-date: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include -- Up-to-date: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/cutlass/version_extended.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/test/cutlass -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/test/cutlass/bin -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/test/cutlass/lib64 -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/test/cutlass/ctest -- Up-to-date: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/ -- Up-to-date: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/GPU_Clock.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/command_line.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/cublas_wrappers.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/debug.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_dump.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_groupnorm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_layernorm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_memory.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_nchw_to_nhwc.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_nhwc_padding.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_nhwc_pooling.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_nhwc_to_nchw.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_rmsnorm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/device_utils.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/distribution.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/exceptions.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/gett_commandline.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/helper_cuda.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/host_reorder.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/host_tensor.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/host_tensor_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/host_uncompress.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/index_sequence.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/packed_stride.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/print_error.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/detail -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/detail/inner_product.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/detail/linear_to_coordinate.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/convolution.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/gemm_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/gemm_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/gett.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/kernel -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/kernel/gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/kernel/tensor_elementwise.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/kernel/tensor_foreach.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/rank_2k_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/tensor_compare.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/tensor_fill.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/tensor_foreach.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/tensor_reduce.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/tensor_relu.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/thread -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/device/thread/gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/conv.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/convolution.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/error_metrics.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/gemm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/gemm_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/gemm_planar_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/gett.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/rank_2k.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/rank_2k_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/rank_k_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/symm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/symm_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_compare.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_compare.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_copy.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_elementwise.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_fill.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_fill.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_foreach.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_norm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_reduce.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/tensor_reduce.hpp -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/trmm.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/reference/host/trmm_complex.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/tensor_view_io.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/util/type_traits.h -- Up-to-date: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include/ -- Up-to-date: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library/arch_mappings.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library/descriptions.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library/handle.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library/library.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library/manifest.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library/operation_table.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library/singleton.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library/types.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/include//cutlass/library/util.h -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm50_cgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm50_cgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm50_dgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm50_dgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm50_sgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm50_sgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm60_hgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm60_hgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm61_igemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm61_igemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm61_s8_igemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm61_s8_igemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_f16_s884gemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_f16_s884gemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_f16_s884gemm_planar_complex_array_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_f16_s884gemm_planar_complex_array_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_f16_s884gemm_planar_complex_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_f16_s884gemm_planar_complex_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_h884gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_h884gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_h884gemm_planar_complex.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_h884gemm_planar_complex.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_h884gemm_planar_complex_array.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_h884gemm_planar_complex_array.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_s884gemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_s884gemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_s884gemm_planar_complex_array_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_s884gemm_planar_complex_array_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_s884gemm_planar_complex_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_s884gemm_planar_complex_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_f16_s1688gemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_f16_s1688gemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_array_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_array_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_h1688gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_h1688gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_h1688gemm_planar_complex.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_h1688gemm_planar_complex.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_h1688gemm_planar_complex_array.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_h1688gemm_planar_complex_array.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i88128xorgemm_b1.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i88128xorgemm_b1.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8816gemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8816gemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8816gemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8816gemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8832gemm_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8832gemm_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8832gemm_u4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8832gemm_u4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s1688gemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s1688gemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s1688gemm_planar_complex_array_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s1688gemm_planar_complex_array_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s1688gemm_planar_complex_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s1688gemm_planar_complex_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s4_i8832gemm_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s4_i8832gemm_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s8_i8816gemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s8_i8816gemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_u4_i8832gemm_u4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_u4_i8832gemm_u4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_u8_i8816gemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_u8_i8816gemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_bf16_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_bf16_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_bf16_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_bf16_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_s8_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_s8_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_u8_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_u8_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16832spgemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16832spgemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_c1688gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_c1688gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_c1688tf32gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_c1688tf32gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_cgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_cgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_d884gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_d884gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_dgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_dgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_f16_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_f16_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_f16_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_f16_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_array_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_array_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_s8_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_s8_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_u8_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_u8_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16832spgemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16832spgemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_gz884gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_gz884gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_f16_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_f16_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_f16_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_f16_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_grouped.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_grouped.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_planar_complex.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_planar_complex.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_planar_complex_array.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_planar_complex_array.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_s8_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_s8_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_u8_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_u8_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16832spgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16832spgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i168128spgemm_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i168128spgemm_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i168256andgemm_b1.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i168256andgemm_b1.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i168256xorgemm_b1.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i168256xorgemm_b1.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_s4_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_s4_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_s8_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_s8_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16864gemm_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16864gemm_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16864gemm_u4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16864gemm_u4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16864spgemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16864spgemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_bf16_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_bf16_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_bf16_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_bf16_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_f16_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_f16_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_f16_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_f16_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_grouped_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_grouped_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_grouped_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_grouped_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_array_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_array_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_array_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_array_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_s8_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_s8_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_s8_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_s8_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_u8_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_u8_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_u8_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_u8_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816tf32spgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816tf32spgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16832spgemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16832spgemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16832spgemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16832spgemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688bf16gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688bf16gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688f16gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688f16gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688gemm_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688gemm_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688tf32gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688tf32gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s4_i168128spgemm_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s4_i168128spgemm_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s4_i16864gemm_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s4_i16864gemm_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16832gemm_s4_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16832gemm_s4_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16832gemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16832gemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16832gemm_s8_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16832gemm_s8_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16864spgemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16864spgemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_sgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_sgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_tf32_s1688gemm_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_tf32_s1688gemm_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_u4_i16864gemm_u4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_u4_i16864gemm_u4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_u8_i16832gemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_u8_i16832gemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_z884gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_z884gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x16gemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x16gemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32spgemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32spgemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_d1684gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_d1684gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x16gemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x16gemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32spgemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32spgemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_gz1684gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_gz1684gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_h64x128x16gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_h64x128x16gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_h64x128x32spgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_h64x128x32spgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x32gemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x32gemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x32gemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x32gemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x64spgemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x64spgemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x64spgemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x64spgemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16gemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16gemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16gemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16gemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16spgemm_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16spgemm_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16tf32spgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16tf32spgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32spgemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32spgemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32spgemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32spgemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x8gemm_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x8gemm_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x8tf32gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x8tf32gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x32gemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x32gemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x32gemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x32gemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x64spgemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x64spgemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x64spgemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x64spgemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_h64x128x16gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_h64x128x16gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_h64x128x32spgemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_h64x128x32spgemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x32gemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x32gemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x32gemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x32gemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x64spgemm_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x64spgemm_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x64spgemm_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x64spgemm_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x16gemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x16gemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x16gemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x16gemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32spgemm_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32spgemm_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32spgemm_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32spgemm_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_z1684gemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_z1684gemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_cf32_cdgrad_optimized_cf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_cf32_cdgrad_optimized_cf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_cf32_cfprop_optimized_cf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_cf32_cfprop_optimized_cf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_cf32_cwgrad_optimized_cf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_cf32_cwgrad_optimized_cf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_sdgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_sdgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_sfprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_sfprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_swgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_swgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm60_hfprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm60_hfprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_f16_s884dgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_f16_s884dgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_f16_s884fprop_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_f16_s884fprop_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_f16_s884wgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_f16_s884wgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_h884dgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_h884dgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_h884fprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_h884fprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_h884wgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_h884wgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_s884dgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_s884dgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_s884fprop_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_s884fprop_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_s884wgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_s884wgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_cf32_cdgrad_optimized_cf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_cf32_cdgrad_optimized_cf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_cf32_cfprop_optimized_cf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_cf32_cfprop_optimized_cf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_cf32_cwgrad_optimized_cf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_cf32_cwgrad_optimized_cf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688dgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688dgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688fprop_few_channels_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688fprop_few_channels_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688fprop_fixed_channels_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688fprop_fixed_channels_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688fprop_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688fprop_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688wgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688wgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688dgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688dgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688fprop_few_channels.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688fprop_few_channels.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688fprop_fixed_channels.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688fprop_fixed_channels.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688fprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688fprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688wgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688wgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8816fprop_optimized_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8816fprop_optimized_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8816fprop_optimized_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8816fprop_optimized_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8832fprop_optimized_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8832fprop_optimized_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8832fprop_optimized_u4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8832fprop_optimized_u4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688dgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688dgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688fprop_few_channels_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688fprop_few_channels_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688fprop_fixed_channels_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688fprop_fixed_channels_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688fprop_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688fprop_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688wgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688wgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s4_i8832fprop_optimized_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s4_i8832fprop_optimized_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s8_i8816fprop_few_channels_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s8_i8816fprop_few_channels_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s8_i8816fprop_fixed_channels_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s8_i8816fprop_fixed_channels_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s8_i8816fprop_optimized_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s8_i8816fprop_optimized_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u4_i8832fprop_optimized_u4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u4_i8832fprop_optimized_u4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u8_i8816fprop_few_channels_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u8_i8816fprop_few_channels_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u8_i8816fprop_fixed_channels_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u8_i8816fprop_fixed_channels_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u8_i8816fprop_optimized_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u8_i8816fprop_optimized_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816dgrad_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816dgrad_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816fprop_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816fprop_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816wgrad_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816wgrad_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816dgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816dgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816fprop_fixed_channels_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816fprop_fixed_channels_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816fprop_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816fprop_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816wgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816wgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816dgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816dgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816fprop_fixed_channels.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816fprop_fixed_channels.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816fprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816fprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816wgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816wgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16832fprop_optimized_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16832fprop_optimized_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16832fprop_optimized_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16832fprop_optimized_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16864fprop_optimized_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16864fprop_optimized_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16864fprop_optimized_u4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16864fprop_optimized_u4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816dgrad_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816dgrad_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816dgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816dgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_fixed_channels_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_fixed_channels_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_fixed_channels_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_fixed_channels_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816wgrad_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816wgrad_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816wgrad_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816wgrad_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688bf16dgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688bf16dgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688bf16fprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688bf16fprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688bf16wgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688bf16wgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688dgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688dgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688dgrad_optimized_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688dgrad_optimized_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688f16dgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688f16dgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688f16fprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688f16fprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688f16wgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688f16wgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688fprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688fprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688fprop_optimized_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688fprop_optimized_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688tf32dgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688tf32dgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688tf32fprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688tf32fprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688tf32wgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688tf32wgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688wgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688wgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688wgrad_optimized_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688wgrad_optimized_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s4_i16864fprop_optimized_s4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s4_i16864fprop_optimized_s4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s8_i16832fprop_few_channels_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s8_i16832fprop_few_channels_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s8_i16832fprop_fixed_channels_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s8_i16832fprop_fixed_channels_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s8_i16832fprop_optimized_s8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s8_i16832fprop_optimized_s8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_sdgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_sdgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_sfprop_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_sfprop_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_swgrad_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_swgrad_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_tf32_s1688dgrad_optimized_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_tf32_s1688dgrad_optimized_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_tf32_s1688fprop_optimized_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_tf32_s1688fprop_optimized_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_tf32_s1688wgrad_optimized_tf32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_tf32_s1688wgrad_optimized_tf32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u4_i16864fprop_optimized_u4.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u4_i16864fprop_optimized_u4.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u8_i16832fprop_few_channels_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u8_i16832fprop_few_channels_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u8_i16832fprop_fixed_channels_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u8_i16832fprop_fixed_channels_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u8_i16832fprop_optimized_u8.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u8_i16832fprop_optimized_u8.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816dgrad3d_analytic_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816dgrad3d_analytic_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816dgrad3d_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816dgrad3d_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816fprop3d_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816fprop3d_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816wgrad3d_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816wgrad3d_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816dgrad3d_analytic.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816dgrad3d_analytic.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816dgrad3d_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816dgrad3d_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816fprop3d_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816fprop3d_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816wgrad3d_optimized.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816wgrad3d_optimized.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_analytic_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_analytic_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_analytic_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_analytic_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816fprop3d_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816fprop3d_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816fprop3d_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816fprop3d_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816wgrad3d_optimized_bf16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816wgrad3d_optimized_bf16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816wgrad3d_optimized_f16.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816wgrad3d_optimized_f16.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688herk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688herk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688tf32herk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688tf32herk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688tf32syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688tf32syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_d884syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_d884syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_gz884herk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_gz884herk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_gz884syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_gz884syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_s1688syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_s1688syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_s1688tf32syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_s1688tf32syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_z884herk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_z884herk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_z884syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_z884syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_d1684syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_d1684syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_gz1684herk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_gz1684herk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_gz1684syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_gz1684syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_z1684herk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_z1684herk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_z1684syrk.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_z1684syrk.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688her2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688her2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688tf32her2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688tf32her2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688tf32syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688tf32syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_d884syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_d884syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_gz884her2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_gz884her2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_gz884syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_gz884syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_s1688syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_s1688syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_s1688tf32syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_s1688tf32syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_z884her2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_z884her2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_z884syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_z884syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_d1684syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_d1684syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_gz1684her2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_gz1684her2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_gz1684syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_gz1684syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_z1684her2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_z1684her2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_z1684syr2k.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_z1684syr2k.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_c1688tf32trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_c1688tf32trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_c1688trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_c1688trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_d884trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_d884trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_gz884trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_gz884trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_s1688tf32trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_s1688tf32trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_s1688trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_s1688trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_z884trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_z884trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm90_d1684trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm90_d1684trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm90_gz1684trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm90_gz1684trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm90_z1684trmm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm90_z1684trmm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688hemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688hemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688tf32hemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688tf32hemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688tf32symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688tf32symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_d884symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_d884symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_gz884hemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_gz884hemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_gz884symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_gz884symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_s1688symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_s1688symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_s1688tf32symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_s1688tf32symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_z884hemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_z884hemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_z884symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_z884symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_d1684symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_d1684symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_gz1684hemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_gz1684hemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_gz1684symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_gz1684symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_z1684hemm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_z1684hemm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_z1684symm.so -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_z1684symm.a -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/share/info/cutlass/generated_kernels.txt -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/bin/cutlass_profiler -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/test/cutlass/ctest/ctest_profiler/CTestTestfile.ctest_profiler.cmake -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/test/cutlass/CTestTestfile.cmake -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/cmake/NvidiaCutlass/NvidiaCutlassConfig.cmake -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/cmake/NvidiaCutlass/NvidiaCutlassConfigVersion.cmake -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/cmake/NvidiaCutlass/NvidiaCutlassTargets.cmake -- Installing: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/cmake/NvidiaCutlass/NvidiaCutlassTargets-release.cmake + popd ~/build/BUILD/cutlass + rm -rf /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/test + rm -rf /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/share/info + set +x Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/bin/cutlass_profiler Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_cf32_cdgrad_optimized_cf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_cf32_cfprop_optimized_cf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_cf32_cwgrad_optimized_cf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_sdgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_sfprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm50_swgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm60_hfprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_f16_s884dgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_f16_s884fprop_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_f16_s884wgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_h884dgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_h884fprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_h884wgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_s884dgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_s884fprop_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm70_s884wgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_cf32_cdgrad_optimized_cf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_cf32_cfprop_optimized_cf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_cf32_cwgrad_optimized_cf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688dgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688fprop_few_channels_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688fprop_fixed_channels_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688fprop_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_f16_s1688wgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688dgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688fprop_few_channels.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688fprop_fixed_channels.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688fprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_h1688wgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8816fprop_optimized_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8816fprop_optimized_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8832fprop_optimized_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_i8832fprop_optimized_u4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688dgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688fprop_few_channels_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688fprop_fixed_channels_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688fprop_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s1688wgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s4_i8832fprop_optimized_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s8_i8816fprop_few_channels_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s8_i8816fprop_fixed_channels_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_s8_i8816fprop_optimized_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u4_i8832fprop_optimized_u4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u8_i8816fprop_few_channels_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u8_i8816fprop_fixed_channels_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm75_u8_i8816fprop_optimized_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816dgrad_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816fprop_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_bf16_s16816wgrad_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816dgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816fprop_fixed_channels_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816fprop_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_f16_s16816wgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816dgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816fprop_fixed_channels.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816fprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_h16816wgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16832fprop_optimized_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16832fprop_optimized_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16864fprop_optimized_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_i16864fprop_optimized_u4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816dgrad_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816dgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_fixed_channels_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_fixed_channels_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816fprop_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816wgrad_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s16816wgrad_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688bf16dgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688bf16fprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688bf16wgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688dgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688dgrad_optimized_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688f16dgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688f16fprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688f16wgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688fprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688fprop_optimized_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688tf32dgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688tf32fprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688tf32wgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688wgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s1688wgrad_optimized_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s4_i16864fprop_optimized_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s8_i16832fprop_few_channels_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s8_i16832fprop_fixed_channels_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_s8_i16832fprop_optimized_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_sdgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_sfprop_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_swgrad_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_tf32_s1688dgrad_optimized_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_tf32_s1688fprop_optimized_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_tf32_s1688wgrad_optimized_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u4_i16864fprop_optimized_u4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u8_i16832fprop_few_channels_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u8_i16832fprop_fixed_channels_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm80_u8_i16832fprop_optimized_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816dgrad3d_analytic_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816dgrad3d_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816fprop3d_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_f16_s16816wgrad3d_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816dgrad3d_analytic.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816dgrad3d_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816fprop3d_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_h16816wgrad3d_optimized.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_analytic_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_analytic_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816dgrad3d_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816fprop3d_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816fprop3d_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816wgrad3d_optimized_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm80_s16816wgrad3d_optimized_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm50_cgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm50_dgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm50_sgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm60_hgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm61_igemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm61_s8_igemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_f16_s884gemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_f16_s884gemm_planar_complex_array_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_f16_s884gemm_planar_complex_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_h884gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_h884gemm_planar_complex.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_h884gemm_planar_complex_array.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_s884gemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_s884gemm_planar_complex_array_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm70_s884gemm_planar_complex_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_f16_s1688gemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_array_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_h1688gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_h1688gemm_planar_complex.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_h1688gemm_planar_complex_array.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i88128xorgemm_b1.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8816gemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8816gemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8832gemm_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_i8832gemm_u4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s1688gemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s1688gemm_planar_complex_array_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s1688gemm_planar_complex_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s4_i8832gemm_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_s8_i8816gemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_u4_i8832gemm_u4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm75_u8_i8816gemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_bf16_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_bf16_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_s8_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16816gemm_u8_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_bf16_s16832spgemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_c1688gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_c1688tf32gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_cgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_d884gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_dgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_f16_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_f16_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_array_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_s8_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16816gemm_u8_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_f16_s16832spgemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_gz884gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_f16_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_f16_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_grouped.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_planar_complex.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_planar_complex_array.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_s8_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16816gemm_u8_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_h16832spgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i168128spgemm_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i168256andgemm_b1.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i168256xorgemm_b1.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_s4_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_s8_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16832gemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16864gemm_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16864gemm_u4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_i16864spgemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_bf16_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_bf16_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_f16_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_f16_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_grouped_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_grouped_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_array_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_array_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_planar_complex_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_s8_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_s8_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_u8_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816gemm_u8_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16816tf32spgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16832spgemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s16832spgemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688bf16gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688f16gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688gemm_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s1688tf32gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s4_i168128spgemm_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s4_i16864gemm_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16832gemm_s4_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16832gemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16832gemm_s8_s4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_s8_i16864spgemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_sgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_tf32_s1688gemm_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_u4_i16864gemm_u4.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_u8_i16832gemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm80_z884gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm89_s16864spgemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x16gemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x32spgemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_d1684gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x16gemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x32spgemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_gz1684gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_h64x128x16gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_h64x128x32spgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x32gemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x32gemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x64spgemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_i64x128x64spgemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16gemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16gemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16spgemm_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x16tf32spgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32gemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32spgemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x32spgemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x64spgemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x8gemm_tf32.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s64x128x8tf32gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x32gemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x32gemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x64spgemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_s8_i64x128x64spgemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_h64x128x16gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_h64x128x32spgemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x32gemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x32gemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x64spgemm_s8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_i64x128x64spgemm_u8.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x16gemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x16gemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32spgemm_bf16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x32spgemm_f16.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_gemm_sm90_z1684gemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688her2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688tf32her2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_c1688tf32syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_d884syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_gz884her2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_gz884syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_s1688syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_s1688tf32syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_z884her2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm80_z884syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_d1684syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_gz1684her2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_gz1684syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_z1684her2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_2k_sm90_z1684syr2k.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688herk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688tf32herk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_c1688tf32syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_d884syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_gz884herk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_gz884syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_s1688syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_s1688tf32syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_z884herk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm80_z884syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_d1684syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_gz1684herk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_gz1684syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_z1684herk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_rank_k_sm90_z1684syrk.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688hemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688tf32hemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_c1688tf32symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_d884symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_gz884hemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_gz884symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_s1688symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_s1688tf32symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_z884hemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm80_z884symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_d1684symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_gz1684hemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_gz1684symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_z1684hemm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_symm_sm90_z1684symm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_c1688tf32trmm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_c1688trmm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_d884trmm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_gz884trmm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_s1688tf32trmm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_s1688trmm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm80_z884trmm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm90_d1684trmm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm90_gz1684trmm.so Stripping: /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/lib64/libcutlass_trmm_sm90_z1684trmm.so + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/redhat/brp-ldconfig + /usr/lib/rpm/brp-compress + /usr/lib/rpm/brp-strip /usr/bin/strip + /usr/lib/rpm/brp-strip-comment-note /usr/bin/strip /usr/bin/objdump + /usr/lib/rpm/redhat/brp-strip-lto /usr/bin/strip + /usr/lib/rpm/brp-strip-static-archive /usr/bin/strip + /usr/lib/rpm/check-rpaths + /usr/lib/rpm/redhat/brp-mangle-shebangs + /usr/lib/rpm/brp-remove-la-files + env /usr/lib/rpm/redhat/brp-python-bytecompile '' 1 0 -j4 + /usr/lib/rpm/redhat/brp-python-hardlink Processing files: cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.u4DA43 + umask 022 + cd /builddir/build/BUILD + cd cutlass + DOCDIR=/builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/share/doc/cutlass + export LC_ALL= + LC_ALL= + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/share/doc/cutlass + cp -pr /builddir/build/BUILD/cutlass/README.md /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/share/doc/cutlass + cp -pr /builddir/build/BUILD/cutlass/docs /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/share/doc/cutlass + RPM_EC=0 ++ jobs -p + exit 0 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.YNEoDF + umask 022 + cd /builddir/build/BUILD + cd cutlass + LICENSEDIR=/builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/share/licenses/cutlass + export LC_ALL= + LC_ALL= + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/share/licenses/cutlass + cp -pr /builddir/build/BUILD/cutlass/LICENSE.txt /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64/usr/share/licenses/cutlass + RPM_EC=0 ++ jobs -p + exit 0 Provides: cutlass = 3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39 cutlass(aarch-64) = 3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39 libcutlass.so()(64bit) libcutlass_conv2d_sm50_cf32_cdgrad_optimized_cf32.so()(64bit) libcutlass_conv2d_sm50_cf32_cfprop_optimized_cf32.so()(64bit) libcutlass_conv2d_sm50_cf32_cwgrad_optimized_cf32.so()(64bit) libcutlass_conv2d_sm50_sdgrad_optimized.so()(64bit) libcutlass_conv2d_sm50_sfprop_optimized.so()(64bit) libcutlass_conv2d_sm50_swgrad_optimized.so()(64bit) libcutlass_conv2d_sm60_hfprop_optimized.so()(64bit) libcutlass_conv2d_sm70_f16_s884dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_f16_s884fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_f16_s884wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_h884dgrad_optimized.so()(64bit) libcutlass_conv2d_sm70_h884fprop_optimized.so()(64bit) libcutlass_conv2d_sm70_h884wgrad_optimized.so()(64bit) libcutlass_conv2d_sm70_s884dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_s884fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_s884wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_cf32_cdgrad_optimized_cf32.so()(64bit) libcutlass_conv2d_sm75_cf32_cfprop_optimized_cf32.so()(64bit) libcutlass_conv2d_sm75_cf32_cwgrad_optimized_cf32.so()(64bit) libcutlass_conv2d_sm75_f16_s1688dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_f16_s1688fprop_few_channels_f16.so()(64bit) libcutlass_conv2d_sm75_f16_s1688fprop_fixed_channels_f16.so()(64bit) libcutlass_conv2d_sm75_f16_s1688fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_f16_s1688wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_h1688dgrad_optimized.so()(64bit) libcutlass_conv2d_sm75_h1688fprop_few_channels.so()(64bit) libcutlass_conv2d_sm75_h1688fprop_fixed_channels.so()(64bit) libcutlass_conv2d_sm75_h1688fprop_optimized.so()(64bit) libcutlass_conv2d_sm75_h1688wgrad_optimized.so()(64bit) libcutlass_conv2d_sm75_i8816fprop_optimized_s8.so()(64bit) libcutlass_conv2d_sm75_i8816fprop_optimized_u8.so()(64bit) libcutlass_conv2d_sm75_i8832fprop_optimized_s4.so()(64bit) libcutlass_conv2d_sm75_i8832fprop_optimized_u4.so()(64bit) libcutlass_conv2d_sm75_s1688dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_s1688fprop_few_channels_f16.so()(64bit) libcutlass_conv2d_sm75_s1688fprop_fixed_channels_f16.so()(64bit) libcutlass_conv2d_sm75_s1688fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_s1688wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_s4_i8832fprop_optimized_s4.so()(64bit) libcutlass_conv2d_sm75_s8_i8816fprop_few_channels_s8.so()(64bit) libcutlass_conv2d_sm75_s8_i8816fprop_fixed_channels_s8.so()(64bit) libcutlass_conv2d_sm75_s8_i8816fprop_optimized_s8.so()(64bit) libcutlass_conv2d_sm75_u4_i8832fprop_optimized_u4.so()(64bit) libcutlass_conv2d_sm75_u8_i8816fprop_few_channels_u8.so()(64bit) libcutlass_conv2d_sm75_u8_i8816fprop_fixed_channels_u8.so()(64bit) libcutlass_conv2d_sm75_u8_i8816fprop_optimized_u8.so()(64bit) libcutlass_conv2d_sm80_bf16_s16816dgrad_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16.so()(64bit) libcutlass_conv2d_sm80_bf16_s16816fprop_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_bf16_s16816wgrad_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_f16_s16816dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_f16_s16816fprop_fixed_channels_f16.so()(64bit) libcutlass_conv2d_sm80_f16_s16816fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_f16_s16816wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_h16816dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_h16816fprop_fixed_channels.so()(64bit) libcutlass_conv2d_sm80_h16816fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_h16816wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_i16832fprop_optimized_s8.so()(64bit) libcutlass_conv2d_sm80_i16832fprop_optimized_u8.so()(64bit) libcutlass_conv2d_sm80_i16864fprop_optimized_s4.so()(64bit) libcutlass_conv2d_sm80_i16864fprop_optimized_u4.so()(64bit) libcutlass_conv2d_sm80_s16816dgrad_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_s16816dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_s16816fprop_fixed_channels_bf16.so()(64bit) libcutlass_conv2d_sm80_s16816fprop_fixed_channels_f16.so()(64bit) libcutlass_conv2d_sm80_s16816fprop_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_s16816fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_s16816wgrad_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_s16816wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_s1688bf16dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688bf16fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688bf16wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688dgrad_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_s1688f16dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688f16fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688f16wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688fprop_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_s1688tf32dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688tf32fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688tf32wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688wgrad_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_s4_i16864fprop_optimized_s4.so()(64bit) libcutlass_conv2d_sm80_s8_i16832fprop_few_channels_s8.so()(64bit) libcutlass_conv2d_sm80_s8_i16832fprop_fixed_channels_s8.so()(64bit) libcutlass_conv2d_sm80_s8_i16832fprop_optimized_s8.so()(64bit) libcutlass_conv2d_sm80_sdgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_sfprop_optimized.so()(64bit) libcutlass_conv2d_sm80_swgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_tf32_s1688dgrad_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_tf32_s1688fprop_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_tf32_s1688wgrad_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_u4_i16864fprop_optimized_u4.so()(64bit) libcutlass_conv2d_sm80_u8_i16832fprop_few_channels_u8.so()(64bit) libcutlass_conv2d_sm80_u8_i16832fprop_fixed_channels_u8.so()(64bit) libcutlass_conv2d_sm80_u8_i16832fprop_optimized_u8.so()(64bit) libcutlass_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16.so()(64bit) libcutlass_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_f16_s16816dgrad3d_analytic_f16.so()(64bit) libcutlass_conv3d_sm80_f16_s16816dgrad3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_f16_s16816fprop3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_f16_s16816wgrad3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_h16816dgrad3d_analytic.so()(64bit) libcutlass_conv3d_sm80_h16816dgrad3d_optimized.so()(64bit) libcutlass_conv3d_sm80_h16816fprop3d_optimized.so()(64bit) libcutlass_conv3d_sm80_h16816wgrad3d_optimized.so()(64bit) libcutlass_conv3d_sm80_s16816dgrad3d_analytic_bf16.so()(64bit) libcutlass_conv3d_sm80_s16816dgrad3d_analytic_f16.so()(64bit) libcutlass_conv3d_sm80_s16816dgrad3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_s16816dgrad3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_s16816fprop3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_s16816fprop3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_s16816wgrad3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_s16816wgrad3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32.so()(64bit) libcutlass_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32.so()(64bit) libcutlass_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32.so()(64bit) libcutlass_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32.so()(64bit) libcutlass_gemm_sm50_cgemm.so()(64bit) libcutlass_gemm_sm50_dgemm.so()(64bit) libcutlass_gemm_sm50_sgemm.so()(64bit) libcutlass_gemm_sm60_hgemm.so()(64bit) libcutlass_gemm_sm61_igemm_s8.so()(64bit) libcutlass_gemm_sm61_s8_igemm_s8.so()(64bit) libcutlass_gemm_sm70_f16_s884gemm_f16.so()(64bit) libcutlass_gemm_sm70_f16_s884gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm70_f16_s884gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm70_h884gemm.so()(64bit) libcutlass_gemm_sm70_h884gemm_planar_complex.so()(64bit) libcutlass_gemm_sm70_h884gemm_planar_complex_array.so()(64bit) libcutlass_gemm_sm70_s884gemm_f16.so()(64bit) libcutlass_gemm_sm70_s884gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm70_s884gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm75_f16_s1688gemm_f16.so()(64bit) libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm75_h1688gemm.so()(64bit) libcutlass_gemm_sm75_h1688gemm_planar_complex.so()(64bit) libcutlass_gemm_sm75_h1688gemm_planar_complex_array.so()(64bit) libcutlass_gemm_sm75_i88128xorgemm_b1.so()(64bit) libcutlass_gemm_sm75_i8816gemm_s8.so()(64bit) libcutlass_gemm_sm75_i8816gemm_u8.so()(64bit) libcutlass_gemm_sm75_i8832gemm_s4.so()(64bit) libcutlass_gemm_sm75_i8832gemm_u4.so()(64bit) libcutlass_gemm_sm75_s1688gemm_f16.so()(64bit) libcutlass_gemm_sm75_s1688gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm75_s1688gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm75_s4_i8832gemm_s4.so()(64bit) libcutlass_gemm_sm75_s8_i8816gemm_s8.so()(64bit) libcutlass_gemm_sm75_u4_i8832gemm_u4.so()(64bit) libcutlass_gemm_sm75_u8_i8816gemm_u8.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_bf16_s8.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_bf16_u8.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_s8_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_u8_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16832spgemm_bf16.so()(64bit) libcutlass_gemm_sm80_c1688gemm.so()(64bit) libcutlass_gemm_sm80_c1688tf32gemm.so()(64bit) libcutlass_gemm_sm80_cgemm.so()(64bit) libcutlass_gemm_sm80_d884gemm.so()(64bit) libcutlass_gemm_sm80_dgemm.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_f16_s8.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_f16_u8.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_s8_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_u8_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16832spgemm_f16.so()(64bit) libcutlass_gemm_sm80_gz884gemm.so()(64bit) libcutlass_gemm_sm80_h16816gemm.so()(64bit) libcutlass_gemm_sm80_h16816gemm_f16_s8.so()(64bit) libcutlass_gemm_sm80_h16816gemm_f16_u8.so()(64bit) libcutlass_gemm_sm80_h16816gemm_grouped.so()(64bit) libcutlass_gemm_sm80_h16816gemm_planar_complex.so()(64bit) libcutlass_gemm_sm80_h16816gemm_planar_complex_array.so()(64bit) libcutlass_gemm_sm80_h16816gemm_s8_f16.so()(64bit) libcutlass_gemm_sm80_h16816gemm_u8_f16.so()(64bit) libcutlass_gemm_sm80_h16832spgemm.so()(64bit) libcutlass_gemm_sm80_i168128spgemm_s4.so()(64bit) libcutlass_gemm_sm80_i168256andgemm_b1.so()(64bit) libcutlass_gemm_sm80_i168256xorgemm_b1.so()(64bit) libcutlass_gemm_sm80_i16832gemm_s4_s8.so()(64bit) libcutlass_gemm_sm80_i16832gemm_s8.so()(64bit) libcutlass_gemm_sm80_i16832gemm_s8_s4.so()(64bit) libcutlass_gemm_sm80_i16832gemm_u8.so()(64bit) libcutlass_gemm_sm80_i16864gemm_s4.so()(64bit) libcutlass_gemm_sm80_i16864gemm_u4.so()(64bit) libcutlass_gemm_sm80_i16864spgemm_s8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_bf16_s8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_bf16_u8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_f16_s8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_f16_u8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_grouped_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_grouped_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_planar_complex_array_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_planar_complex_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_s8_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_s8_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_u8_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_u8_f16.so()(64bit) libcutlass_gemm_sm80_s16816tf32spgemm.so()(64bit) libcutlass_gemm_sm80_s16832spgemm_bf16.so()(64bit) libcutlass_gemm_sm80_s16832spgemm_f16.so()(64bit) libcutlass_gemm_sm80_s1688bf16gemm.so()(64bit) libcutlass_gemm_sm80_s1688f16gemm.so()(64bit) libcutlass_gemm_sm80_s1688gemm.so()(64bit) libcutlass_gemm_sm80_s1688gemm_tf32.so()(64bit) libcutlass_gemm_sm80_s1688tf32gemm.so()(64bit) libcutlass_gemm_sm80_s4_i168128spgemm_s4.so()(64bit) libcutlass_gemm_sm80_s4_i16864gemm_s4.so()(64bit) libcutlass_gemm_sm80_s8_i16832gemm_s4_s8.so()(64bit) libcutlass_gemm_sm80_s8_i16832gemm_s8.so()(64bit) libcutlass_gemm_sm80_s8_i16832gemm_s8_s4.so()(64bit) libcutlass_gemm_sm80_s8_i16864spgemm_s8.so()(64bit) libcutlass_gemm_sm80_sgemm.so()(64bit) libcutlass_gemm_sm80_tf32_s1688gemm_tf32.so()(64bit) libcutlass_gemm_sm80_u4_i16864gemm_u4.so()(64bit) libcutlass_gemm_sm80_u8_i16832gemm_u8.so()(64bit) libcutlass_gemm_sm80_z884gemm.so()(64bit) libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3.so()(64bit) libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2.so()(64bit) libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm89_s16864spgemm_e4m3.so()(64bit) libcutlass_gemm_sm89_s16864spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm89_s16864spgemm_e5m2.so()(64bit) libcutlass_gemm_sm89_s16864spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x16gemm_bf16.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32spgemm_bf16.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_d1684gemm.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x16gemm_f16.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32spgemm_f16.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_gz1684gemm.so()(64bit) libcutlass_gemm_sm90_h64x128x16gemm.so()(64bit) libcutlass_gemm_sm90_h64x128x32spgemm.so()(64bit) libcutlass_gemm_sm90_i64x128x32gemm_s8.so()(64bit) libcutlass_gemm_sm90_i64x128x32gemm_u8.so()(64bit) libcutlass_gemm_sm90_i64x128x64spgemm_s8.so()(64bit) libcutlass_gemm_sm90_i64x128x64spgemm_u8.so()(64bit) libcutlass_gemm_sm90_s64x128x16gemm_bf16.so()(64bit) libcutlass_gemm_sm90_s64x128x16gemm_f16.so()(64bit) libcutlass_gemm_sm90_s64x128x16spgemm_tf32.so()(64bit) libcutlass_gemm_sm90_s64x128x16tf32spgemm.so()(64bit) libcutlass_gemm_sm90_s64x128x32gemm_e4m3.so()(64bit) libcutlass_gemm_sm90_s64x128x32gemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_s64x128x32gemm_e5m2.so()(64bit) libcutlass_gemm_sm90_s64x128x32gemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_s64x128x32spgemm_bf16.so()(64bit) libcutlass_gemm_sm90_s64x128x32spgemm_f16.so()(64bit) libcutlass_gemm_sm90_s64x128x64spgemm_e4m3.so()(64bit) libcutlass_gemm_sm90_s64x128x64spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_s64x128x64spgemm_e5m2.so()(64bit) libcutlass_gemm_sm90_s64x128x64spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_s64x128x8gemm_tf32.so()(64bit) libcutlass_gemm_sm90_s64x128x8tf32gemm.so()(64bit) libcutlass_gemm_sm90_s8_i64x128x32gemm_s8.so()(64bit) libcutlass_gemm_sm90_s8_i64x128x32gemm_u8.so()(64bit) libcutlass_gemm_sm90_s8_i64x128x64spgemm_s8.so()(64bit) libcutlass_gemm_sm90_s8_i64x128x64spgemm_u8.so()(64bit) libcutlass_gemm_sm90_void_h64x128x16gemm.so()(64bit) libcutlass_gemm_sm90_void_h64x128x32spgemm.so()(64bit) libcutlass_gemm_sm90_void_i64x128x32gemm_s8.so()(64bit) libcutlass_gemm_sm90_void_i64x128x32gemm_u8.so()(64bit) libcutlass_gemm_sm90_void_i64x128x64spgemm_s8.so()(64bit) libcutlass_gemm_sm90_void_i64x128x64spgemm_u8.so()(64bit) libcutlass_gemm_sm90_void_s64x128x16gemm_bf16.so()(64bit) libcutlass_gemm_sm90_void_s64x128x16gemm_f16.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32spgemm_bf16.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32spgemm_f16.so()(64bit) libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3.so()(64bit) libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2.so()(64bit) libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_z1684gemm.so()(64bit) libcutlass_rank_2k_sm80_c1688her2k.so()(64bit) libcutlass_rank_2k_sm80_c1688syr2k.so()(64bit) libcutlass_rank_2k_sm80_c1688tf32her2k.so()(64bit) libcutlass_rank_2k_sm80_c1688tf32syr2k.so()(64bit) libcutlass_rank_2k_sm80_d884syr2k.so()(64bit) libcutlass_rank_2k_sm80_gz884her2k.so()(64bit) libcutlass_rank_2k_sm80_gz884syr2k.so()(64bit) libcutlass_rank_2k_sm80_s1688syr2k.so()(64bit) libcutlass_rank_2k_sm80_s1688tf32syr2k.so()(64bit) libcutlass_rank_2k_sm80_z884her2k.so()(64bit) libcutlass_rank_2k_sm80_z884syr2k.so()(64bit) libcutlass_rank_2k_sm90_d1684syr2k.so()(64bit) libcutlass_rank_2k_sm90_gz1684her2k.so()(64bit) libcutlass_rank_2k_sm90_gz1684syr2k.so()(64bit) libcutlass_rank_2k_sm90_z1684her2k.so()(64bit) libcutlass_rank_2k_sm90_z1684syr2k.so()(64bit) libcutlass_rank_k_sm80_c1688herk.so()(64bit) libcutlass_rank_k_sm80_c1688syrk.so()(64bit) libcutlass_rank_k_sm80_c1688tf32herk.so()(64bit) libcutlass_rank_k_sm80_c1688tf32syrk.so()(64bit) libcutlass_rank_k_sm80_d884syrk.so()(64bit) libcutlass_rank_k_sm80_gz884herk.so()(64bit) libcutlass_rank_k_sm80_gz884syrk.so()(64bit) libcutlass_rank_k_sm80_s1688syrk.so()(64bit) libcutlass_rank_k_sm80_s1688tf32syrk.so()(64bit) libcutlass_rank_k_sm80_z884herk.so()(64bit) libcutlass_rank_k_sm80_z884syrk.so()(64bit) libcutlass_rank_k_sm90_d1684syrk.so()(64bit) libcutlass_rank_k_sm90_gz1684herk.so()(64bit) libcutlass_rank_k_sm90_gz1684syrk.so()(64bit) libcutlass_rank_k_sm90_z1684herk.so()(64bit) libcutlass_rank_k_sm90_z1684syrk.so()(64bit) libcutlass_symm_sm80_c1688hemm.so()(64bit) libcutlass_symm_sm80_c1688symm.so()(64bit) libcutlass_symm_sm80_c1688tf32hemm.so()(64bit) libcutlass_symm_sm80_c1688tf32symm.so()(64bit) libcutlass_symm_sm80_d884symm.so()(64bit) libcutlass_symm_sm80_gz884hemm.so()(64bit) libcutlass_symm_sm80_gz884symm.so()(64bit) libcutlass_symm_sm80_s1688symm.so()(64bit) libcutlass_symm_sm80_s1688tf32symm.so()(64bit) libcutlass_symm_sm80_z884hemm.so()(64bit) libcutlass_symm_sm80_z884symm.so()(64bit) libcutlass_symm_sm90_d1684symm.so()(64bit) libcutlass_symm_sm90_gz1684hemm.so()(64bit) libcutlass_symm_sm90_gz1684symm.so()(64bit) libcutlass_symm_sm90_z1684hemm.so()(64bit) libcutlass_symm_sm90_z1684symm.so()(64bit) libcutlass_trmm_sm80_c1688tf32trmm.so()(64bit) libcutlass_trmm_sm80_c1688trmm.so()(64bit) libcutlass_trmm_sm80_d884trmm.so()(64bit) libcutlass_trmm_sm80_gz884trmm.so()(64bit) libcutlass_trmm_sm80_s1688tf32trmm.so()(64bit) libcutlass_trmm_sm80_s1688trmm.so()(64bit) libcutlass_trmm_sm80_z884trmm.so()(64bit) libcutlass_trmm_sm90_d1684trmm.so()(64bit) libcutlass_trmm_sm90_gz1684trmm.so()(64bit) libcutlass_trmm_sm90_z1684trmm.so()(64bit) Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: libc.so.6()(64bit) libc.so.6(GLIBC_2.17)(64bit) libc.so.6(GLIBC_2.34)(64bit) libcuda.so.1()(64bit) libcudart.so.12()(64bit) libcudart.so.12(libcudart.so.12)(64bit) libcutlass.so()(64bit) libcutlass_conv2d_sm50_cf32_cdgrad_optimized_cf32.so()(64bit) libcutlass_conv2d_sm50_cf32_cfprop_optimized_cf32.so()(64bit) libcutlass_conv2d_sm50_cf32_cwgrad_optimized_cf32.so()(64bit) libcutlass_conv2d_sm50_sdgrad_optimized.so()(64bit) libcutlass_conv2d_sm50_sfprop_optimized.so()(64bit) libcutlass_conv2d_sm50_swgrad_optimized.so()(64bit) libcutlass_conv2d_sm60_hfprop_optimized.so()(64bit) libcutlass_conv2d_sm70_f16_s884dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_f16_s884fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_f16_s884wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_h884dgrad_optimized.so()(64bit) libcutlass_conv2d_sm70_h884fprop_optimized.so()(64bit) libcutlass_conv2d_sm70_h884wgrad_optimized.so()(64bit) libcutlass_conv2d_sm70_s884dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_s884fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm70_s884wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_cf32_cdgrad_optimized_cf32.so()(64bit) libcutlass_conv2d_sm75_cf32_cfprop_optimized_cf32.so()(64bit) libcutlass_conv2d_sm75_cf32_cwgrad_optimized_cf32.so()(64bit) libcutlass_conv2d_sm75_f16_s1688dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_f16_s1688fprop_few_channels_f16.so()(64bit) libcutlass_conv2d_sm75_f16_s1688fprop_fixed_channels_f16.so()(64bit) libcutlass_conv2d_sm75_f16_s1688fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_f16_s1688wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_h1688dgrad_optimized.so()(64bit) libcutlass_conv2d_sm75_h1688fprop_few_channels.so()(64bit) libcutlass_conv2d_sm75_h1688fprop_fixed_channels.so()(64bit) libcutlass_conv2d_sm75_h1688fprop_optimized.so()(64bit) libcutlass_conv2d_sm75_h1688wgrad_optimized.so()(64bit) libcutlass_conv2d_sm75_i8816fprop_optimized_s8.so()(64bit) libcutlass_conv2d_sm75_i8816fprop_optimized_u8.so()(64bit) libcutlass_conv2d_sm75_i8832fprop_optimized_s4.so()(64bit) libcutlass_conv2d_sm75_i8832fprop_optimized_u4.so()(64bit) libcutlass_conv2d_sm75_s1688dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_s1688fprop_few_channels_f16.so()(64bit) libcutlass_conv2d_sm75_s1688fprop_fixed_channels_f16.so()(64bit) libcutlass_conv2d_sm75_s1688fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_s1688wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm75_s4_i8832fprop_optimized_s4.so()(64bit) libcutlass_conv2d_sm75_s8_i8816fprop_few_channels_s8.so()(64bit) libcutlass_conv2d_sm75_s8_i8816fprop_fixed_channels_s8.so()(64bit) libcutlass_conv2d_sm75_s8_i8816fprop_optimized_s8.so()(64bit) libcutlass_conv2d_sm75_u4_i8832fprop_optimized_u4.so()(64bit) libcutlass_conv2d_sm75_u8_i8816fprop_few_channels_u8.so()(64bit) libcutlass_conv2d_sm75_u8_i8816fprop_fixed_channels_u8.so()(64bit) libcutlass_conv2d_sm75_u8_i8816fprop_optimized_u8.so()(64bit) libcutlass_conv2d_sm80_bf16_s16816dgrad_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_bf16_s16816fprop_fixed_channels_bf16.so()(64bit) libcutlass_conv2d_sm80_bf16_s16816fprop_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_bf16_s16816wgrad_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_f16_s16816dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_f16_s16816fprop_fixed_channels_f16.so()(64bit) libcutlass_conv2d_sm80_f16_s16816fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_f16_s16816wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_h16816dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_h16816fprop_fixed_channels.so()(64bit) libcutlass_conv2d_sm80_h16816fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_h16816wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_i16832fprop_optimized_s8.so()(64bit) libcutlass_conv2d_sm80_i16832fprop_optimized_u8.so()(64bit) libcutlass_conv2d_sm80_i16864fprop_optimized_s4.so()(64bit) libcutlass_conv2d_sm80_i16864fprop_optimized_u4.so()(64bit) libcutlass_conv2d_sm80_s16816dgrad_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_s16816dgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_s16816fprop_fixed_channels_bf16.so()(64bit) libcutlass_conv2d_sm80_s16816fprop_fixed_channels_f16.so()(64bit) libcutlass_conv2d_sm80_s16816fprop_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_s16816fprop_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_s16816wgrad_optimized_bf16.so()(64bit) libcutlass_conv2d_sm80_s16816wgrad_optimized_f16.so()(64bit) libcutlass_conv2d_sm80_s1688bf16dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688bf16fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688bf16wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688dgrad_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_s1688f16dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688f16fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688f16wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688fprop_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_s1688tf32dgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688tf32fprop_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688tf32wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688wgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_s1688wgrad_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_s4_i16864fprop_optimized_s4.so()(64bit) libcutlass_conv2d_sm80_s8_i16832fprop_few_channels_s8.so()(64bit) libcutlass_conv2d_sm80_s8_i16832fprop_fixed_channels_s8.so()(64bit) libcutlass_conv2d_sm80_s8_i16832fprop_optimized_s8.so()(64bit) libcutlass_conv2d_sm80_sdgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_sfprop_optimized.so()(64bit) libcutlass_conv2d_sm80_swgrad_optimized.so()(64bit) libcutlass_conv2d_sm80_tf32_s1688dgrad_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_tf32_s1688fprop_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_tf32_s1688wgrad_optimized_tf32.so()(64bit) libcutlass_conv2d_sm80_u4_i16864fprop_optimized_u4.so()(64bit) libcutlass_conv2d_sm80_u8_i16832fprop_few_channels_u8.so()(64bit) libcutlass_conv2d_sm80_u8_i16832fprop_fixed_channels_u8.so()(64bit) libcutlass_conv2d_sm80_u8_i16832fprop_optimized_u8.so()(64bit) libcutlass_conv2d_sm90_f16128x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x192x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x192x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16128x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x96x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f16256x96x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x128x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x128x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x128x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x128x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x256x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x256x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x256x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x256x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x64x16dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x64x16fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x64x8dgrad_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f1664x64x8fprop_f16nhwc_f16nhwc_f16_f16_f16.so()(64bit) libcutlass_conv2d_sm90_f32128x192x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32128x256x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x16dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x16fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x8dgrad_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x8fprop_bf16nhwc_bf16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x128x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f32256x96x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f3264x64x16dgrad_f16nhwc_f16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f3264x64x16fprop_f16nhwc_f16nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_f3264x64x8fprop_f32nhwc_f32nhwc_f32_f32_f32.so()(64bit) libcutlass_conv2d_sm90_s32128x256x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv2d_sm90_s32128x256x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv2d_sm90_s32256x128x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv2d_sm90_s32256x128x32fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv2d_sm90_s3264x64x16fprop_s8nhwc_s8nhwc_s32_s32_s32.so()(64bit) libcutlass_conv3d_sm80_bf16_s16816dgrad3d_analytic_bf16.so()(64bit) libcutlass_conv3d_sm80_bf16_s16816dgrad3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_bf16_s16816fprop3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_bf16_s16816wgrad3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_f16_s16816dgrad3d_analytic_f16.so()(64bit) libcutlass_conv3d_sm80_f16_s16816dgrad3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_f16_s16816fprop3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_f16_s16816wgrad3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_h16816dgrad3d_analytic.so()(64bit) libcutlass_conv3d_sm80_h16816dgrad3d_optimized.so()(64bit) libcutlass_conv3d_sm80_h16816fprop3d_optimized.so()(64bit) libcutlass_conv3d_sm80_h16816wgrad3d_optimized.so()(64bit) libcutlass_conv3d_sm80_s16816dgrad3d_analytic_bf16.so()(64bit) libcutlass_conv3d_sm80_s16816dgrad3d_analytic_f16.so()(64bit) libcutlass_conv3d_sm80_s16816dgrad3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_s16816dgrad3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_s16816fprop3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_s16816fprop3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm80_s16816wgrad3d_optimized_bf16.so()(64bit) libcutlass_conv3d_sm80_s16816wgrad3d_optimized_f16.so()(64bit) libcutlass_conv3d_sm90_f3264x64x16dgrad_f16ndhwc_f16ndhwc_f32_f32_f32.so()(64bit) libcutlass_conv3d_sm90_f3264x64x16fprop_f16ndhwc_f16ndhwc_f32_f32_f32.so()(64bit) libcutlass_conv3d_sm90_f3264x64x8fprop_f32ndhwc_f32ndhwc_f32_f32_f32.so()(64bit) libcutlass_conv3d_sm90_s3264x64x16fprop_s8ndhwc_s8ndhwc_s32_s32_s32.so()(64bit) libcutlass_gemm_sm50_cgemm.so()(64bit) libcutlass_gemm_sm50_dgemm.so()(64bit) libcutlass_gemm_sm50_sgemm.so()(64bit) libcutlass_gemm_sm60_hgemm.so()(64bit) libcutlass_gemm_sm61_igemm_s8.so()(64bit) libcutlass_gemm_sm61_s8_igemm_s8.so()(64bit) libcutlass_gemm_sm70_f16_s884gemm_f16.so()(64bit) libcutlass_gemm_sm70_f16_s884gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm70_f16_s884gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm70_h884gemm.so()(64bit) libcutlass_gemm_sm70_h884gemm_planar_complex.so()(64bit) libcutlass_gemm_sm70_h884gemm_planar_complex_array.so()(64bit) libcutlass_gemm_sm70_s884gemm_f16.so()(64bit) libcutlass_gemm_sm70_s884gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm70_s884gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm75_f16_s1688gemm_f16.so()(64bit) libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm75_f16_s1688gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm75_h1688gemm.so()(64bit) libcutlass_gemm_sm75_h1688gemm_planar_complex.so()(64bit) libcutlass_gemm_sm75_h1688gemm_planar_complex_array.so()(64bit) libcutlass_gemm_sm75_i88128xorgemm_b1.so()(64bit) libcutlass_gemm_sm75_i8816gemm_s8.so()(64bit) libcutlass_gemm_sm75_i8816gemm_u8.so()(64bit) libcutlass_gemm_sm75_i8832gemm_s4.so()(64bit) libcutlass_gemm_sm75_i8832gemm_u4.so()(64bit) libcutlass_gemm_sm75_s1688gemm_f16.so()(64bit) libcutlass_gemm_sm75_s1688gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm75_s1688gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm75_s4_i8832gemm_s4.so()(64bit) libcutlass_gemm_sm75_s8_i8816gemm_s8.so()(64bit) libcutlass_gemm_sm75_u4_i8832gemm_u4.so()(64bit) libcutlass_gemm_sm75_u8_i8816gemm_u8.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_bf16_s8.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_bf16_u8.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_array_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_planar_complex_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_s8_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16816gemm_u8_bf16.so()(64bit) libcutlass_gemm_sm80_bf16_s16832spgemm_bf16.so()(64bit) libcutlass_gemm_sm80_c1688gemm.so()(64bit) libcutlass_gemm_sm80_c1688tf32gemm.so()(64bit) libcutlass_gemm_sm80_cgemm.so()(64bit) libcutlass_gemm_sm80_d884gemm.so()(64bit) libcutlass_gemm_sm80_dgemm.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_f16_s8.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_f16_u8.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_s8_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16816gemm_u8_f16.so()(64bit) libcutlass_gemm_sm80_f16_s16832spgemm_f16.so()(64bit) libcutlass_gemm_sm80_gz884gemm.so()(64bit) libcutlass_gemm_sm80_h16816gemm.so()(64bit) libcutlass_gemm_sm80_h16816gemm_f16_s8.so()(64bit) libcutlass_gemm_sm80_h16816gemm_f16_u8.so()(64bit) libcutlass_gemm_sm80_h16816gemm_grouped.so()(64bit) libcutlass_gemm_sm80_h16816gemm_planar_complex.so()(64bit) libcutlass_gemm_sm80_h16816gemm_planar_complex_array.so()(64bit) libcutlass_gemm_sm80_h16816gemm_s8_f16.so()(64bit) libcutlass_gemm_sm80_h16816gemm_u8_f16.so()(64bit) libcutlass_gemm_sm80_h16832spgemm.so()(64bit) libcutlass_gemm_sm80_i168128spgemm_s4.so()(64bit) libcutlass_gemm_sm80_i168256andgemm_b1.so()(64bit) libcutlass_gemm_sm80_i168256xorgemm_b1.so()(64bit) libcutlass_gemm_sm80_i16832gemm_s4_s8.so()(64bit) libcutlass_gemm_sm80_i16832gemm_s8.so()(64bit) libcutlass_gemm_sm80_i16832gemm_s8_s4.so()(64bit) libcutlass_gemm_sm80_i16832gemm_u8.so()(64bit) libcutlass_gemm_sm80_i16864gemm_s4.so()(64bit) libcutlass_gemm_sm80_i16864gemm_u4.so()(64bit) libcutlass_gemm_sm80_i16864spgemm_s8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_bf16_s8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_bf16_u8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_f16_s8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_f16_u8.so()(64bit) libcutlass_gemm_sm80_s16816gemm_grouped_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_grouped_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_planar_complex_array_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_planar_complex_array_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_planar_complex_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_planar_complex_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_s8_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_s8_f16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_u8_bf16.so()(64bit) libcutlass_gemm_sm80_s16816gemm_u8_f16.so()(64bit) libcutlass_gemm_sm80_s16816tf32spgemm.so()(64bit) libcutlass_gemm_sm80_s16832spgemm_bf16.so()(64bit) libcutlass_gemm_sm80_s16832spgemm_f16.so()(64bit) libcutlass_gemm_sm80_s1688bf16gemm.so()(64bit) libcutlass_gemm_sm80_s1688f16gemm.so()(64bit) libcutlass_gemm_sm80_s1688gemm.so()(64bit) libcutlass_gemm_sm80_s1688gemm_tf32.so()(64bit) libcutlass_gemm_sm80_s1688tf32gemm.so()(64bit) libcutlass_gemm_sm80_s4_i168128spgemm_s4.so()(64bit) libcutlass_gemm_sm80_s4_i16864gemm_s4.so()(64bit) libcutlass_gemm_sm80_s8_i16832gemm_s4_s8.so()(64bit) libcutlass_gemm_sm80_s8_i16832gemm_s8.so()(64bit) libcutlass_gemm_sm80_s8_i16832gemm_s8_s4.so()(64bit) libcutlass_gemm_sm80_s8_i16864spgemm_s8.so()(64bit) libcutlass_gemm_sm80_sgemm.so()(64bit) libcutlass_gemm_sm80_tf32_s1688gemm_tf32.so()(64bit) libcutlass_gemm_sm80_u4_i16864gemm_u4.so()(64bit) libcutlass_gemm_sm80_u8_i16832gemm_u8.so()(64bit) libcutlass_gemm_sm80_z884gemm.so()(64bit) libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3.so()(64bit) libcutlass_gemm_sm89_s16864fastaccumspgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2.so()(64bit) libcutlass_gemm_sm89_s16864fastaccumspgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm89_s16864spgemm_e4m3.so()(64bit) libcutlass_gemm_sm89_s16864spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm89_s16864spgemm_e5m2.so()(64bit) libcutlass_gemm_sm89_s16864spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x16gemm_bf16.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32gemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32gemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x32spgemm_bf16.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2.so()(64bit) libcutlass_gemm_sm90_bf16_s64x128x64spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_d1684gemm.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x16gemm_f16.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32gemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32gemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x32spgemm_f16.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x64spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2.so()(64bit) libcutlass_gemm_sm90_f16_s64x128x64spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_gz1684gemm.so()(64bit) libcutlass_gemm_sm90_h64x128x16gemm.so()(64bit) libcutlass_gemm_sm90_h64x128x32spgemm.so()(64bit) libcutlass_gemm_sm90_i64x128x32gemm_s8.so()(64bit) libcutlass_gemm_sm90_i64x128x32gemm_u8.so()(64bit) libcutlass_gemm_sm90_i64x128x64spgemm_s8.so()(64bit) libcutlass_gemm_sm90_i64x128x64spgemm_u8.so()(64bit) libcutlass_gemm_sm90_s64x128x16gemm_bf16.so()(64bit) libcutlass_gemm_sm90_s64x128x16gemm_f16.so()(64bit) libcutlass_gemm_sm90_s64x128x16spgemm_tf32.so()(64bit) libcutlass_gemm_sm90_s64x128x16tf32spgemm.so()(64bit) libcutlass_gemm_sm90_s64x128x32gemm_e4m3.so()(64bit) libcutlass_gemm_sm90_s64x128x32gemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_s64x128x32gemm_e5m2.so()(64bit) libcutlass_gemm_sm90_s64x128x32gemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_s64x128x32spgemm_bf16.so()(64bit) libcutlass_gemm_sm90_s64x128x32spgemm_f16.so()(64bit) libcutlass_gemm_sm90_s64x128x64spgemm_e4m3.so()(64bit) libcutlass_gemm_sm90_s64x128x64spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_s64x128x64spgemm_e5m2.so()(64bit) libcutlass_gemm_sm90_s64x128x64spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_s64x128x8gemm_tf32.so()(64bit) libcutlass_gemm_sm90_s64x128x8tf32gemm.so()(64bit) libcutlass_gemm_sm90_s8_i64x128x32gemm_s8.so()(64bit) libcutlass_gemm_sm90_s8_i64x128x32gemm_u8.so()(64bit) libcutlass_gemm_sm90_s8_i64x128x64spgemm_s8.so()(64bit) libcutlass_gemm_sm90_s8_i64x128x64spgemm_u8.so()(64bit) libcutlass_gemm_sm90_void_h64x128x16gemm.so()(64bit) libcutlass_gemm_sm90_void_h64x128x32spgemm.so()(64bit) libcutlass_gemm_sm90_void_i64x128x32gemm_s8.so()(64bit) libcutlass_gemm_sm90_void_i64x128x32gemm_u8.so()(64bit) libcutlass_gemm_sm90_void_i64x128x64spgemm_s8.so()(64bit) libcutlass_gemm_sm90_void_i64x128x64spgemm_u8.so()(64bit) libcutlass_gemm_sm90_void_s64x128x16gemm_bf16.so()(64bit) libcutlass_gemm_sm90_void_s64x128x16gemm_f16.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32gemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32gemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32spgemm_bf16.so()(64bit) libcutlass_gemm_sm90_void_s64x128x32spgemm_f16.so()(64bit) libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3.so()(64bit) libcutlass_gemm_sm90_void_s64x128x64spgemm_e4m3_e5m2.so()(64bit) libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2.so()(64bit) libcutlass_gemm_sm90_void_s64x128x64spgemm_e5m2_e4m3.so()(64bit) libcutlass_gemm_sm90_z1684gemm.so()(64bit) libcutlass_rank_2k_sm80_c1688her2k.so()(64bit) libcutlass_rank_2k_sm80_c1688syr2k.so()(64bit) libcutlass_rank_2k_sm80_c1688tf32her2k.so()(64bit) libcutlass_rank_2k_sm80_c1688tf32syr2k.so()(64bit) libcutlass_rank_2k_sm80_d884syr2k.so()(64bit) libcutlass_rank_2k_sm80_gz884her2k.so()(64bit) libcutlass_rank_2k_sm80_gz884syr2k.so()(64bit) libcutlass_rank_2k_sm80_s1688syr2k.so()(64bit) libcutlass_rank_2k_sm80_s1688tf32syr2k.so()(64bit) libcutlass_rank_2k_sm80_z884her2k.so()(64bit) libcutlass_rank_2k_sm80_z884syr2k.so()(64bit) libcutlass_rank_2k_sm90_d1684syr2k.so()(64bit) libcutlass_rank_2k_sm90_gz1684her2k.so()(64bit) libcutlass_rank_2k_sm90_gz1684syr2k.so()(64bit) libcutlass_rank_2k_sm90_z1684her2k.so()(64bit) libcutlass_rank_2k_sm90_z1684syr2k.so()(64bit) libcutlass_rank_k_sm80_c1688herk.so()(64bit) libcutlass_rank_k_sm80_c1688syrk.so()(64bit) libcutlass_rank_k_sm80_c1688tf32herk.so()(64bit) libcutlass_rank_k_sm80_c1688tf32syrk.so()(64bit) libcutlass_rank_k_sm80_d884syrk.so()(64bit) libcutlass_rank_k_sm80_gz884herk.so()(64bit) libcutlass_rank_k_sm80_gz884syrk.so()(64bit) libcutlass_rank_k_sm80_s1688syrk.so()(64bit) libcutlass_rank_k_sm80_s1688tf32syrk.so()(64bit) libcutlass_rank_k_sm80_z884herk.so()(64bit) libcutlass_rank_k_sm80_z884syrk.so()(64bit) libcutlass_rank_k_sm90_d1684syrk.so()(64bit) libcutlass_rank_k_sm90_gz1684herk.so()(64bit) libcutlass_rank_k_sm90_gz1684syrk.so()(64bit) libcutlass_rank_k_sm90_z1684herk.so()(64bit) libcutlass_rank_k_sm90_z1684syrk.so()(64bit) libcutlass_symm_sm80_c1688hemm.so()(64bit) libcutlass_symm_sm80_c1688symm.so()(64bit) libcutlass_symm_sm80_c1688tf32hemm.so()(64bit) libcutlass_symm_sm80_c1688tf32symm.so()(64bit) libcutlass_symm_sm80_d884symm.so()(64bit) libcutlass_symm_sm80_gz884hemm.so()(64bit) libcutlass_symm_sm80_gz884symm.so()(64bit) libcutlass_symm_sm80_s1688symm.so()(64bit) libcutlass_symm_sm80_s1688tf32symm.so()(64bit) libcutlass_symm_sm80_z884hemm.so()(64bit) libcutlass_symm_sm80_z884symm.so()(64bit) libcutlass_symm_sm90_d1684symm.so()(64bit) libcutlass_symm_sm90_gz1684hemm.so()(64bit) libcutlass_symm_sm90_gz1684symm.so()(64bit) libcutlass_symm_sm90_z1684hemm.so()(64bit) libcutlass_symm_sm90_z1684symm.so()(64bit) libcutlass_trmm_sm80_c1688tf32trmm.so()(64bit) libcutlass_trmm_sm80_c1688trmm.so()(64bit) libcutlass_trmm_sm80_d884trmm.so()(64bit) libcutlass_trmm_sm80_gz884trmm.so()(64bit) libcutlass_trmm_sm80_s1688tf32trmm.so()(64bit) libcutlass_trmm_sm80_s1688trmm.so()(64bit) libcutlass_trmm_sm80_z884trmm.so()(64bit) libcutlass_trmm_sm90_d1684trmm.so()(64bit) libcutlass_trmm_sm90_gz1684trmm.so()(64bit) libcutlass_trmm_sm90_z1684trmm.so()(64bit) libgcc_s.so.1()(64bit) libgcc_s.so.1(GCC_3.0)(64bit) libm.so.6()(64bit) libm.so.6(GLIBC_2.17)(64bit) libm.so.6(GLIBC_2.29)(64bit) libstdc++.so.6()(64bit) libstdc++.so.6(CXXABI_1.3)(64bit) libstdc++.so.6(CXXABI_1.3.5)(64bit) libstdc++.so.6(CXXABI_1.3.9)(64bit) libstdc++.so.6(GLIBCXX_3.4)(64bit) libstdc++.so.6(GLIBCXX_3.4.11)(64bit) libstdc++.so.6(GLIBCXX_3.4.15)(64bit) libstdc++.so.6(GLIBCXX_3.4.18)(64bit) libstdc++.so.6(GLIBCXX_3.4.20)(64bit) libstdc++.so.6(GLIBCXX_3.4.21)(64bit) libstdc++.so.6(GLIBCXX_3.4.26)(64bit) libstdc++.so.6(GLIBCXX_3.4.29)(64bit) libstdc++.so.6(GLIBCXX_3.4.32)(64bit) libstdc++.so.6(GLIBCXX_3.4.5)(64bit) libstdc++.so.6(GLIBCXX_3.4.9)(64bit) rtld(GNU_HASH) Processing files: cutlass-devel-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 Provides: cmake(NvidiaCutlass) = 3.6.0 cmake(nvidiacutlass) = 3.6.0 cutlass-devel = 3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39 cutlass-devel(aarch-64) = 3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: cmake-filesystem(aarch-64) Processing files: cutlass-static-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 Provides: cutlass-static = 3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39 cutlass-static(aarch-64) = 3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 Wrote: /builddir/build/RPMS/cutlass-devel-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64.rpm Wrote: /builddir/build/RPMS/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64.rpm Wrote: /builddir/build/RPMS/cutlass-static-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64.rpm Executing(%clean): /bin/sh -e /var/tmp/rpm-tmp.VylqGh + umask 022 + cd /builddir/build/BUILD + cd cutlass + /usr/bin/rm -rf /builddir/build/BUILDROOT/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.aarch64 + RPM_EC=0 ++ jobs -p + exit 0 Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.AgMg3c + umask 022 + cd /builddir/build/BUILD + rm -rf /builddir/build/BUILD/cutlass-SPECPARTS + rm -rf cutlass cutlass.gemspec + RPM_EC=0 ++ jobs -p + exit 0 Finish: rpmbuild cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm Finish: build phase for cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm INFO: chroot_scan: 3 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-39-aarch64-1732846793.454290/root/var/log/dnf.log /var/lib/mock/fedora-39-aarch64-1732846793.454290/root/var/log/dnf.librepo.log /var/lib/mock/fedora-39-aarch64-1732846793.454290/root/var/log/dnf.rpm.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names INFO: Done(/var/lib/copr-rpmbuild/results/cutlass-3.6.0-20241009.0.gitcc3c29a8.cu12_6.fc39.src.rpm) Config(child) 1007 minutes 51 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot Finish: run Running RPMResults tool Package info: { "packages": [ { "name": "cutlass-devel", "epoch": null, "version": "3.6.0", "release": "20241009.0.gitcc3c29a8.cu12_6.fc39", "arch": "aarch64" }, { "name": "cutlass", "epoch": null, "version": "3.6.0", "release": "20241009.0.gitcc3c29a8.cu12_6.fc39", "arch": "src" }, { "name": "cutlass", "epoch": null, "version": "3.6.0", "release": "20241009.0.gitcc3c29a8.cu12_6.fc39", "arch": "aarch64" }, { "name": "cutlass-static", "epoch": null, "version": "3.6.0", "release": "20241009.0.gitcc3c29a8.cu12_6.fc39", "arch": "aarch64" } ] } RPMResults finished